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I.  INTRODUCTION 

The  research  performed  under  this  Contract  has  been  concerned  with  a 
number  of  topics  within  the  general  area  of  decision,  estimation  and  control 
for  systems  operating  in  an  environment  of  uncertainty.  Of  special  interest 
have  been  decentralized  decision  and  control  problems  that  arise  typically 
in  connection  with  large-scale  systems,  and  in  particular  with  Naval  CJ  sys¬ 
tems.  Included  here  have  been  our  development  of  decentralized  shortest 
path  algorithms  for  networks,  our  examination  of  decentralized  sequential 
detection  and  decision-making  problems,  and  our  studies  of  quantitative 
measures  of  interaction  between  system  inputs  and  outputs  (or  between 
subsystems)  and  the  implications  of  these  for  near  disturbance  localization 
and  near  decoupling  and  for  estimation  and  control  problems.  Among  the 
other  probiems'we  have  examined  are  the  use  of  the  cutoff  rate  as  a  cri¬ 
terion  for  signal  design  in  coded,  direct-detection  optical  communication,  a 
unified  treatment  of  nonlinear  smoothing  problems  with  a  vaiiety  of  state 
and  observation  models,  and  the  design  of  a  minimum-order  observer  that 
estimates  a  prescribed  multi-dimensional  linear  function  of  the  system  state. 

We  now  review  in  turn  our  research  in  each  of  these  areas.  Since 
detailed  developments  are  available  in  published  papers  or  in  drafts  submit¬ 
ted,  or  to  be  submitted,  for  publication,  we  simply  outline  the  results 
obtained  and  refer  to  the  appropriate  papers,  which  are  included  as  appen¬ 
dices,  for  more  complete  and  detailed  expositions.  The  six  major  topic  areas 
we  discuss  are: 

Optimum  Signal  Design  for  Coded,  Direct-Detection  Optical  Communi¬ 
cation  Systems. 

Informationally-Decentralized  Network  Problems. 

Decentralized  Estimation  and  Control. 

Nonlinear  Smoothing  Algorithms. 

Minimum-Order  Observers  and  State  Estimators 
Decentralized  Sequential  Detection  and  Decision  Making. 
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II,  ACCOMPLISHMENTS 


Optimum  Signal  Design  for  Coded,  Direct-Detection  Optical  Commun¬ 
ication  Systems. 

This  research  was  concerned  with  the  coordinated  design  of  the 
encoder,  optical  modulator  and  demodulator  for  a  digital  communication 
system  employing  an  optical  carrier  and  direct  detection.  The  basis  of  our 
new  approach  was  the  reformulation  of  this  signal  design  problem  to  use 
the  cutoff  rate  as  the  performance  measure  instead  of  the  usually-employed 
probability  of  error.  We  derived  the  cutoff  rate  for  a  digital  communication 
system  employing  an  optical  carrier  and  direct  detection,  and  we  used  this 
as  the  performance  measure  in  studying  the  coordinated  design  of  the  opti¬ 
cal  modulator  and  demodulator.  The  choice  of  modulation  that  maximizes 
the  cutoff  rate  was  derived  for  various  relationships  between  peak  ampli¬ 
tude  and  average  energy  constraints  on  the  transmitted  optical  signal.  In 
particular,  we  showed  that  pulse  position  modulation  is  optimum  when  the 
average  energy  constraint  is  predominant,  while  Hadamard  matrices  can  be 
used  to  define  an  optimum  choice  of  modulation  when  the  peak  amplitude 
constraint  predominates.  We  also  addressed  within  this  framework  prob¬ 
lems  of  efficient  energy  utilization,  the  choice  of  the  dimensions  of  the 
input  and  output  alphabets,  and  the  effect  of  random  detector  gain. 

The  results  of  this  extended  research  effort  are  contained  in  the  journal 
paper: 


"Some  Implications  of  the  Cutoff-Rate  Criterion  for  Coded,  Direct 
Detection  Optical  Communication  Systems",  Donald  L.  Snyder  and  Ian 
B.  Rhodes,  IEEE  Transactions  on  Information  Theory,  IT-26,  No.  3,  pp. 
327-338,  May  1980. 

These  same  ideas  and  analysis  were  also  extended  to  the  situation 
where  polarization  modulation  is  employed  in  the  optical  modulator  as  well 
as  temporal  modulation,  and  completely  analogous  results  were  shown  to 
hold.  Specifically,  for  an  input  alphabet  of  dimension  4,  the  optimal  modula¬ 
tion  when  average  signal  energy  constraints  predominate  employs  binary 
pulse-position  and  binary  polarization  modulation;  such  a  modulation 
scheme  has  in  fact  been  employed  in  gigabit-per-second  satellite  optical 
communication  systems. 

These  results  for  polarization  modulation  were  presented  in  the  confer¬ 
ence  paper: 

"Quaternary  Pulse  Modulation  is  Optimal  for  Optical  Communication  at 
One  Gigabit-per  Second",  Donald  L.  Snyder  and  Ian  B.  Rhodes,  Nationai 
Communications  Conference,  Washington,  D.C.,  November  27-29,  1979. 


Informationally-Decentralized  Network  Problems. 

This  research  effort  was  concerned  with  the  development  of 
informationally-decentralized  shortest  path  algorithms  that  enable  each 
node  in  a  network  to  find  its  shortest  distance  to  any  other  node  using  only 
local  knowledge  of  the  network  topology  and  only  local  information 
transfer  between  adjacent  nodes.  We  succeeded  in  devising  several  dynamic 
algorithms  that  accomplish  these  goals  and  can  accommodate  multiple 
changes  in  the  network,  including  not  only  branch  length  increases  and 
decreases  but  also  topological  changes  such  as  the  loss  of  branches  or 
nodes  and  the  introduction  or  reintroduction  of  branches  or  nodes.  The 
ability  of  an  algorithm  to  handle  such  changes  while  retaining  its 
informationally-  and  topologically-decentralized  character  is  essential  in 
many  practical  applications,  including  especially  those  that  arise  in  connec¬ 
tion  with  C3  systems.  Furthermore,  all  of  our  algorithms  operate  asynchro¬ 
nously  and  converge  to  the  optimum  in  finite  time. 

All  of  our  algorithms  can  be  considered  to  be  modifications  of  or  alter¬ 
natives  to  our  main  dynamic  algorithm  which  can  accommodate  all  possible 
topological  changes  in  the  network,  operates  asynchronously,  has  localized 
information  and  communication  requirements,  and  is  guaranteed  to  con¬ 
verge  in  finite  time.  These  modified  or  alternative  algorithms  retain  these 
properties  but  differ  in  their  complexity,  in  their  methods  for  handling  the 
various  topological  changes  that  can  occur,  and  in  the  specific  applications 
for  which  they  are  most  appropriate.  These  variations  result  from 
differences  between  the  several  alternative  mechanisms  we  devised  for  pro¬ 
pagating  the  effects  of  any  change  throughout  the  network;  because  the 
algorithms  are  to  operate  asynchronously  and  with  minimal  information 
transfer  between  adjacent  nodes  only,  awareness  of  the  change  and  incor¬ 
poration  of  its  effects  propagates  in  a  nondeterministic  way  through  the 
network.  In  particular,  a  means  must  be  provided  to  ensure  that  each 
affected  node  learns  of  any  change  that  has  occurred  and  subsequently 
accepts  and  propagates  further  only  distance  information  that  can  be 
guaranteed  to  fully  incorporate  the  effects  of  the  change.  Several  different 
changes  may  take  place  simultaneously,  including  branch  failures  that  may 
disrupt  the  propagation  of  the  effects  of  other  changes.  The  introduction  of 
new  branches  or  nodes  into  a  network  in  which  the  effects  of  other  changes 
may  still  be  propagating  is  an  especially  delicate  matter  that  requires 
sophisticated  control  mechanisms  if  difficulties  are  to  be  avoided  and  con 
vergence  guaranteed. 

These  algorithms  are  reported  in  detail  in  the  journal  paper: 

"Some  Shortest  Path  Algorithms  with  Decentralized  Information  and 
Communication  Requirements".  Jeffrey  M.  Abram  and  Ian  B.  Rhodes, 
IEEE  Transactions  on  Automatic  Control,  AC-27,  No.  3,  pp.  570-582. 
June  1982. 
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Some  of  these  results  were  also  given  in  the  conference  presentation: 

"Some  Informationally-Decentralized  Network  Algorithms",  Jeffrey  M. 
Abram  and  lan  B.  Rhodes,  Proceedings  of  the  1980  Joint  Automatic 
Control  Conference,  San  Francisco,  California,  August  13-15,  1980. 


Decentralized  Estimation  and  Control 

This  research  has  been  motivated  by  our  interest  in  decentralized  esti¬ 
mation,  decision  and  control  problems.  Such  problems  arise  typically  in 
connection  with  large  systems,  including  especially  C  systems.  In  other 
sections  of  this  report  we  discuss  our  development  of  decentralized  algo¬ 
rithms  for  determining  the  shortest  paths  in  a  network  and  our  investiga¬ 
tion  of  decentralized  detection  and  decision-making  problems.  In  this  sec¬ 
tion  we  review  our  examination  of  the  structural  properties  of  linear  sys¬ 
tems  and  the  implications  of  these  in  the  analysis  and  design  of  decentral¬ 
ized  estimation  and  control  algorithms. 

The  underlying  basis  of  this  research  is  our  development  of  quantita¬ 
tive  measures  of  reachability  and  observability.  This  work  was  presented  at 
the  8th  /FAC  Congress . 

"Some  Quantitative  Measures  of  Controllability  and  Observability  and 
their  Implications",  Ian  B.  Rhodes,  Proceedings  of  the  Eighth  Triennial 
World  Congress  of  the  International  Federation  of  Automatic  Control, 
Kyoto,  Japan,  August  24-28,  1981. 

and  an  earlier  version  at  the  MIT/ONR  Workshop  on  C3  Problems: 

"Recent  Results  in  Estimation  Theory",  lan  B.  Rhodes,  Third  MIT/ONR 
Workshop  on  Distributed  Information  and  Decision  Systems  Motivated 
by  Command-Control-Communication  Problems,  Washington,  D.C., 
May/June  1980. 

It  provides  measures  of  reachabilty  and  unreachability,  and  of  observability 
and  unobservability,  using  Fenchel  duality  theory:  the  usual  system- 
theoretic  dualities  are  also  preserved  by  these  measures. 

Within  this  framework  we  have  quantified  the  degree  of  interaction  or 
noninteraction  between  system  input  and  output.  The  sum  of  the  second- 
order  modes  of  the  system,  interpreted  as  a  measure  of  the  overall  strength 
of  interaction  between  system  inputs  and  outputs,  arises  naturally  when  the 
established  theories  of  completely  noninteracting  system  design  (which  are 
based  on  subspace  inclusion  relations)  are  generalized  to  design  based 
approximate  noninteraction  using  continuous-valued  measures.  In  other 
words,  this  framework  has  provided  the  setting  for  generalizing  the  body  of 


results  concerning  disturbance  rejection  and  decoupling  by  providing  a  set¬ 
ting  where  questions  of  degree  can  be  addressed  and  not  just  questions  of 
kind. 

In  this  work,  the  sum  of  the  second-order  modes  is  taken  as  the  central 
cost  functional  in  a  theory  of  near-disturbance  localization.  In  the  case  that 
exact  disturbance  localization  is  possible,  we  recover  the  results  of 
geometric  state-space  theory.  For  minimum-phase  systems  satisfying  the 
uniform  rank  condition,  the  results  we  obtain  agree  with  those  obtained 
with  the  "cheap  control"  approach  to  near  non-interaction.  In  this  case  we 
obtain  a  control  law  of  the  same  type  as  the  solutions  to  the  exact  distur¬ 
bance  localization  problem,  which  furthermore  reduces  to  a  deadbeat  con¬ 
trol  on  the  "observable  subspace".  The  optimal  feedback  gain  is  also  expres¬ 
sible  as  the  limit,  as  the  weighting  on  the  control  energy  goes  to  zero,  of  the 
solution  to  a  linear-quadratic  regulator  problem.  The  optimal  feedback 
gains  are  also  shown  to  satisfy  a  first-order  necessary  condition. 

These  results  have  been  outlined  in  the  conference  presentation: 

"Near  Disturbance  Localization  Using  Second-Order  Modes",  Joan  M. 
Saniuk  and  lan  B.  Rhodes,  Proceedings  of  the  23rd  Annual  Allerton 
Conference  on  Control,  Communications  and  Computing,  University  of 
Illinois,  Oct.  1985. 

A  full  length  journal  paper  is  in  preparation.  These  studies  have  also  led  to 
the  development  of  an  ancillary  result  which  is  presented  in: 

"A  Matrix  Inequality  Associated  with  Bounds  on  Solutions  of  Algebraic 
Riccati  and  Lyapunov  Equations",  Joan  M.  Saniuk  and  lan  B.  Rhodes, 
IEEE  Transactions  on  Automatic  Control,  AC-32,  No.  8,  Aug.  198 7,  pd. 
739-740. 


Nonlinear  Smoothing  Algorithms. 

This  research  effort  has  produced  a  unified  treatment  of  the  smoothing 
problem  for  large  classes  of  state  and  measurement  processes.  The  state 
processes  lxr}  considered  are  of  two  types:  (i)  processes  evolving  in  accor¬ 
dance  with  a  diffusion  equation  of  the  Ito  type,  and  (ii)  continuous-time, 
finite-state  Markov  processes.  The  observation  processes  considered  are 
also  of  two  types:  (i)  the  familiar  observation  model  consisting  of  a  non¬ 
linear  function  of  the  state  additively  corrupted  by  white  Gaussian  noise, 
and  (ii)  a  doubly-stochastic  counting  process,  the  rate  of  which  is  dependent 
on  the  state. 

Within  this  unified  framework,  we  have  derived  a  number  of  alternative 
representations  of  the  solution  to  the  nonlinear  smoothing  problem.  These 
include  as  special  cases  the  numerous  existing  results  tor  specific  choices  of 
state  and  observation  processes,  and  they  extend  or  generalize  these  results 
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to  all  possible  combinations  cf  state  and  measurement  processes  covered  by 
this  single,  comprehensive  formulation. 

The  alternative  representations  are: 

(1)  A  symmetric  representation  of  the  solution  to  the  smoothing  prob¬ 
lem  in  terms  of  the  solutions  to  two  filtering  problems,  one  of  which 
evolves  forwards  in  time  and  the  other  backwards  in  time.  The  first 
of  these  is  simply  the  conventional  filter  that  estimates  the  present 
state  in  terms  of  past  observations.  The  second  estimates  the 
present  state  in  terms  of  future  observations,  and  it  is  just  like  the 
first  except  that  it  operates  in  reverse  time  and  requires  models  of 
the  state  and  observation  processes  that  are  equivalent  to  the  given 
models  but  evolve  backwards  in  time.  We  provide  the  construction 
of  such  reverse-time  models  by  appropriately  generalizing  and 
extending  available  results  to  encompass  the  full  range  of  state  and 
observation  models  that  we  consider. 

(2)  An  asymmetric  representation  of  the  solution  to  the  smoothing 
problem  in  terms  of  a  conventional  filter  that  runs  forward  in  time 
and  a  likelihood  ratio  that  evolves  backwards  in  time.  Again,  the 
backwards  evolution  of  the  likelihood  ratio  is  easily  determined 
once  equivalent  reverse-time  models  for  the  state  and  observation 
processes  are  at  hand. 

(3)  A  representation  that  is  not  driven  by  the  observation  process  and 
that  expresses  the  solution  of  the  smoothing  problem  in  terms  of 
the  solution  to  the  forward  filtering  problem.  This  solution  involves 
first  processing  the  observations  using  a  conventional  forward  filter, 
and  then  generating  the  smoothed  estimates  from  these  forward- 
filtered  estimates  from  a  calculation  that  evolves  backwards  in  time 
but  does  not  require  direct  use  of  the  observations. 

These  results  have  been  published  in  the  journal  article: 

"Smoothing  Algorithms  for  Nonlinear  Finite-Dimensional  Systems", 

Brian  D.  0.  Anderson  and  lan  B.  Rhodes,  Stochastics,  Vol.  9,  1983,  pp. 

139  -  165. 

and  an  abbreviated  conference  counterpart 

"Nonlinear  Smoothing  Algorithms",  Brian.  D.  0.  Anderson  and  Ian  B. 

Rhodes,  1983  IEEE  International  Symposium  on  Information  Theory, 

Les  Arcs,  France,  June  1983. 


Minimum-Order  Observers  and  State  Estimators. 

This  research  has  addressed  a  long-standing  problem,  that  of  designing 
a  minimum-order  observer  that  asymptotically  reconstructs  a  specified 


multi-dimensional  function  of  the  state  of  the  observed  system.  Our  initial 
motivation  for  examining  this  problem  lay  in  the  application  of  such 
observers  in  the  design  and  performance  evaluation  of  decentralized  con¬ 
trollers  for  non-interacting  and  nearly  non-interacting  systems.  What 
resulted  has  been  a  contribution  to  the  fundamental  problem  of  minimum- 
order  observer  design.  We  have  developed  algorithms  for  the  design  of 
both  stable  minimum-order  observers  and  minimum-order  observers  with 
arbitrary  dynamics,  as  well  as  a  collection  of  analytical  results  concerning 
such  issues  as  uniqueness  and  bounds  on  the  dimension  of  the  observer. 
Interestingly,  the  algorithms  we  have  developed  for  the  standard  "central¬ 
ized”  minimum-order  design  problem  make  use  of  results  from  decentral¬ 
ized  control  theory. 

The  observer  design  problem  is  as  follows: 

For  the  constant  linear  system 

x(t)=Ax(t)\  x(t)eRn 

y(t)  =  Cx(t);  y(t)eRm 
design  an  observer 

z(f)  =  Fz(t)  Gy(t):  z(t)eRr 

that  asymptotically  reconstructs  the  prescribed  q-dimensional  function 
Kx(t)  of  the  state  in  the  sense  that 

Mz(f)  +  Ny(t)  — -  Kx(t)  as  (  — •  cc 

The  entities  to  be  chosen  are  the  dimension  r  of  the  observer  and  the 
matrices  F.  G,  M  and  N.  The  objective  is  for  r  to  minimum  consistent  with 
either  (i)  F  is  stable,  or  (ii)  the  eigenvalues  of  F  are  assignable  arbitrarily.  In 
the  former  case  we  require  simply  that  the  observer  be  stable,  while  in  the 
latter  we  incorporate  complete  flexibility  in  the  choice  of  the  observer 
dynamics. 

These  two  observer  design  problems  can  be  expressed  in  the  frame¬ 
work  of  geometric  linear  system  theory:  the  first  problem  becomes  one  of 
finding  a  minimal  (/V,C')-invariant  subspace  r  that,  together  with  the  range 
of  C',  contains  the  range  of  K'  and,  for  some  L  such  that  (A+LCYrCr, 
(A+LCY\r  is  stable.  In  the  language  of  geometric  linear  system  theory,  r  is  a 
minimal  stable  cover  of  the  range  of  K'.  The  second  problem  becomes  one 
of  finding  a  minimal  (A',C')-controllability  subspace  r  that,  together  with  the 
range  of  C,  contains  the  range  of  K'.  in  this  case,  we  seek  a  minimum  cover 
that  is  also  a  controllability  subspace.  Thus,  results  for  minimum-order 
observer  design  have  counterparts  in  the  geometric  theory  of  linear  sys¬ 
tems.  and  vice-versa. 

The  algorithm  we  have  developed  for  constructing  a  minimal  stable 
observer  (cover)  makes  use  of  standard  results  for  (n-m)-dimensional 
reduced-order  observers.  The  idea  underlying  our  approach  is  as  follows: 
we  begin  by  temporarily  augmenting  the  available  output  y  with  certain 
artificial  outputs,  thus  creating  an  augmented  output  ya  with  dimension  m( 
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that  is  used  to  drive  a  standard  (n-ma)-dimensional  reduced-order 
observer.  This  reduced-order  observer  is  then  modified,  keeping  its  dimen¬ 
sion  fixed,  by  removing  the  artificial  outputs  that  were  temporarily  added 
until  only  the  available  output  y  remains  driving  it.  The  removal  of  each 
artificial  output  results  in  constraints  on  the  observer  dynamics  that  reflect 
the  requirement  that  the  prescribed  linear  function  Kx(t)  be  asymptotically 
reconstructible  from  y  and  the  state  of  the  modified  observer.  The  result  is 
a  characterization  of  all  covers  (or,  equivalently,  of  all  not  necessarily  stable 
"observers"  of  dimension  n— ma)  that  meet  the  requirements  on  estimating 
Kx.  If  there  is  a  stable  observer  amongst  these  then  no  more  need  be  done. 
If  not,  the  number  of  artificially-added  outputs  is  reduced  by  one,  thus 
increasing  the  dimension  ( n-ma )  of  the  observer  by  one,  and  the  p.ocess 
repeated.  The  observer  dimension  that  is  used  initially  in  this  iterative  pro¬ 
cess  is  provided  by  a  new  lower  bound  that  we  have  derived  on  the  dimen¬ 
sion  of  the  minimum  stable  observer,  while  a  known  upper  bound  limits  the 
number  of  iterations  that  are  needed  before  a  stable  observer  is  found.  We 
have  also  obtained  new  results  on  the  uniqueness  of  the  minimal  cover  and 
of  the  F  matrix  (modulo  a  similarity  transformation)  in  the  minimal 
observer. 

The  above  characterization  of  all  covers  of  a  given  dimension  that  meet 
the  requirements  on  estimating  Kx  also  provides  the  basis  for  our  solution 
of  the  second  problem:  we  seek  from  among  these  one  that  is  also  a  control¬ 
lability  subspace.  We  do  this  by  reformulating  the  problem  in  such  a  way 
that  we  can  bring  to  bear  a  result  of  Corfmat  and  Morse  that  pertains  to  the 
decentralized  control  of  linear  systems.  In  fact,  the  checking  of  the 
Corfmat-Morse  conditions  in  this  case  can  be  simplified  considerably  by 
exploiting  the  particular  structure  that  arises  here.  Again,  the  overall  pro¬ 
cess  is  an  iterative  one:  if  no  controllability  subspace  can  be  found  from 
among  the  covers  of  a  given  dimension,  then  none  exists  and  the  dimension 
is  increased  by  one  and  the  process  is  repeated.  We  have  derived  lower  and 
upper  bounds  on  the  dimension  of  the  smallest  controllability  subspace  that 
covers  the  range  of  K'\  this  provides  an  initial  observer  dimension  for  this 
iterative  process  and  an  upper  limit  on  the  number  of  iterations  before  a 
solution  is  found. 

The  results  on  the  uniqueness  of  the  minimal  cover  are  available  in  a 
conference  proceedings: 

"Uniqueness  of  the  Minimal  Dynamic  Cover  and  the  Associated  Solu 
tion  to  Sylvester’s  Equation",  Amr  F.  Assal  and  Ian  B.  Rhodes.  Proceed¬ 
ings  of  the  24th  Annual  Allerton  Conference  on  Control,  Communica¬ 
tions  and  Computing,  University  of  Illinois.  Oct.  1986. 

Two  journal  papers  describing  the  results  of  the  major  part  of  this 
research  effort  are  currently  in  preparation. 


Decentralized  Sequential  Detection  and  Decision  Making. 

Problems  of  decision  making  in  the  face  of  statistical  uncertainties  have 
long  been  of  interest  to  decision  theorists  in  various  disciplines.  Detection 
theory  is  one  area  of  decision  making  that  has  received  much  attention,  par¬ 
ticularly  in  surveillance  systems.  The  well-known  theory  of  classical  detec¬ 
tion  has  been  successfully  applied  to  single-sensor  detection  problems  and 
to  multiple-sensor  problems  when  all  collected  data  is  transmitted  to  a  cen¬ 
tral  site  for  processing.  In  those  cases  where  it  is  impractical  or  infeasible 
for  all  the  raw  data  collected  at  the  local  sites  to  be  transmitted  to  a  central 
processing  center,  a  decentralized  (or  distributed)  detection  problem  results 
in  which  each  of  the  local  sites  is  asked  to  perform  some  preprocessing  of 
its  own  raw  data  before  communicating  with  the  fusion  center.  Although 
suboptimal  compared  with  centralized  detection  because  of  the  loss  of 
information  in  local  processors,  decentralized  detection  is  in  many  cases  a 
more  realistic  formulation  in  practical  applications  than  its  centralized  coun¬ 
terpart.  For  example,  in  the  face  of  capacity-constrained  channels,  local  pro¬ 
cessing  could  substantially  decrease  the  communication  bandwidth  to  the 
central  site,  thereby  speeding  up  the  process  and  reducing  the  communica¬ 
tion  costs.  Also,  decentralization  may  be  the  only  natural  way  to  model  the 
problem  in  situations  where  multiple  detectors  of  various  types  are  located 
at  dispersed  geographical  sites.  Furthermore,  decentralized  detection  may 
be  imposed  in  situations  where,  due  to  enormous  amounts  of  available  raw 
data,  centralized  processing  of  the  information  is  not  feasible.  Other  issues 
include  potential  system  reliability  and  integrity  in  the  face  of  failures. 

There  are  a  variety  of  ways  in  which  one  can  pose  a  decentralized 
detecton  problem.  Each  formulation  has  elements  that  capture  one  or  more 
of  the  following  three  features  of  decentralized  detection  and  decision¬ 
making  problems: 

Hierarchical  Structure:  Local  observations  are  processed  and  the  result 
sent  to  a  fusion  center  where  a  final  or  “global”  decision  is  made. 
There  may  or  may  not  be  any  direct  communication  between  the  local 
observers. 

Multi-Stage  or  Sequential  Structure:  Additional  measurements  can  be 
taken  by  the  local  observers.  Each  local  observer  might  wait  until  he 
elects  to  stop  taking  measurements  before  sending  information  to  the 
fusion  center,  or  information  might  be  sent  after  each  observation  and 
the  fusion  center  given  the  flexibility  to  determine  when  to  declare  a 
decision. 

Information  Rate  or  Bandwidth  Reduction:  The  preprocessing  per¬ 
formed  by  each  local  observer  should  result  in  a  significant  reduction 
in  the  amount  of  data  sent  to  the  fusion  center.  An  extreme  example 
is  for  each  local  site  to  make  a  "local  decision"  and  to  communicate 
this  to  the  fusion  center. 

We  have  analyzed  a  broad  class  of  multi-stage  decentralized  detection 
and  decision-making  problems  that  captures  for  the  first  time  all  three  of 
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the  above  elements.  The  most  general  of  these  is  a  multi-stage,  multi¬ 
detector  decentralized  binary  hypothesis-testing  problem  in  which  each 
detector,  after  obtaining  each  of  his  observations,  sends  a  binary  decision  (0 
or  1)  to  the  fusion  center  or  "supervisor",  who  is  given  the  option  of  declar¬ 
ing  a  final  decision  (as  to  which  of  two  possible  hypotheses  is  true)  at  any 
stage  depending  on  the  local  decisions  that  have  been  received.  There  is  a 
cost  associated  with  each  combination  of  the  true  hypothesis  and  the  final 
decision.  In  addition,  a  cost  is  incurred  s  delayed  until  the  next  time  instant. 
There  is  no  communication  among  the  local  detectors. 

We  have  shown  that,  under  appropriate  independence  assumptions  that 
are  in  many  practical  applications  quite  reasonable,  the  optimal  local  stra¬ 
tegies  are  governed  by  threshold  tests  on  the  likelihood  ratio  (i.e., 
likelihood-ratio  tests),  where  the  thresholds  are  precomputable  off-line. 
This  is  an  important  result  since  it  makes  tractable  an  otherwise  intractable 
problem.  Indeed,  the  most  general  problem  is  known  to  be  NP-complete, 
and  the  assumption  that  the  local  observations  are  conditionally  indepen¬ 
dent  given  the  hypotheses  is  crucial  in  providing  the  simplification  of  the 
optimal  strategies  to  likelihood  ratio  tests. 

These  results  are  presented  in  two  publications.  The  first, 

"Decentralized  Sequential  Detection",  H.  R.  Hashemi  and  Ian  B.  Rhodes, 

IEEE  Transactions  on  Information  Theory,  to  appear. 

develops  the  results  for  the  two-detector,  two-stage  problem,  along  with 
some  of  the  important  properties  of  the  optimal  strategies  that  suggest  a 
significant  reduction  in  the  computations  required  to  determine  the  solu¬ 
tion.  The  second, 

"Decentralized  Dynamic  Decision  Making”,  H.  R.  Hashemi  and  Ian  B. 

Rhodes,  Proceedings  of  the  1987  IEEE  Conference  on  Decision  and 

Control,  Los  Angeles,  1987,  pp.  1836-1841  (Invited  Paper). 

contains  the  results  for  the  general  multi-detector,  multi-stage  problem. 
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Quaternary  Pulse  Modulation  Is  Optimal  for  Optical  Conrnunication 
at  One  Gigabit  per  Second* 


Donald  L.  Snyder** 
Ian  B.  Rhodes*** 


SUMMARY 

The  received  data  in  an  optical  communication  system  employing  ideal  direct 
detection  is  modeled  by  a  Poisson  random  point  process,  which  accounts  for  both 
the  quantum  nature  of  light  and  the  optical-to-electrical  energy  conversion  process 
in  the  detector.  Using  this  model,  we  have  evaluated  the  computational  cutoff  rate 
of  the  discrete  channel  formed  by  an  encoder,  optical  modulator-channel-demodulator, 
and  decoder.  We  find  that  quaternary  pulse  modulation,  a  4-ary  scheme  employing 
binary  pulse-position  and  binary  polarization  modulation,  maximizes  this  cutoff 
rate  when  the  optical  field  produced  by  the  modulator  is  subject  to  peak-value  and 
average-energy  constraints.  Such  a  modulation  format  has  been  adopted  in  a  one 
gigabit  per  second  satellite  optical  communication  system  under  development  for 
the  Air  Force. 
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“Some  Implications  of  the  Cutoff-Rate  Criterion  for  Coded  Direct- 
Detection  Optical  Communication  Systems  ”,  Donald  L.  Snyder  and  Ian 
B.  Rhodes,  IEEE  Transactions  on  Information  Theory,  IT-26,  No.  3,  pp. 
327-338,  May  1980. 
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Some  Implications  of  the  Cutoff-Rate 
Criterion  for  Coded  Direct-Detection 
Optical  Communication  Systems 

DONALD  L.  SNYDER,  senior  member,  ieee,  and  FAN  B.  RHODES,  member,  ieee 
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aad  the  effect  of  laadote  detector  ph  are  addressed. 


I.  Introduction 

OUR  CONCERN  in  this  paper  is  with  digital  com¬ 
munication  systems  that  employ  coherent  light  as  a 
carrier  and  direct  detection  as  the  means  to  convert  the 
received  optical  field  into  an  electrical  signal  for  subse¬ 
quent  processing.  Communication  systems  of  this  type  are 
discussed  widely  in  the  literature  (see  [l]-[5]  and  refer¬ 
ences  therein)  and  are  of  increasing  importance  in  ap¬ 
plications.  The  optical  portion  of  the  overall  system  con¬ 
sists  of  the  optical  modulator,  optical  channel,  and  optical 
detector  shown  schematically  in  the  basic  infor¬ 
mation-theoretic  model  of  the  optical  digital  communica¬ 
tion  system  of  Fig.  1.  Here,  E(t,F)  represents  the  tempor¬ 
ally  and  spatially  dependent  complex  envelope  of  the 
optical  field,  and  N(t)  represents  the  counting  process 
associated  with  the  output  of  an  ideal  direct-detection 
device.  This  counting  process  is  assumed  to  be  an  inho¬ 
mogeneous  Poisson  process  with  rate  function  X(f)»j(r) 
+  where  represents  the  contribution  to  the  total 
count  rate  due  to  dark  current  in  the  detector.  Also, 
can  account  for  background  radiation  when  this  is  char- 
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Fig.  1.  Optical  digital  communication  system. 


actenzed  by  many  weak  modal-components  [2],  [3],  The 
assumption  that  N(t )  is  a  Poisson  process  is  met  to  a  close 
approximation  on  the  free-space  channel  for  coherent 
sources  [3].  On  our  model  the  signal  count  rate  r(f)  is 
related  to  E(t,r)  according  to 


s{t)-{T)/hv)j \E(t.r)\tdr,  (1) 

where  ij  is  the  quantum  efficiency  of  the  detector,  h  is 
Planck’s  constanL  »  is  the  optical-earner  frequency,  and  A 
is  the  active  surface  of  the  detector:  it  is  evident  that  s(D 
is  nonnegative,  which,  of  course,  it  must  be  as  a  rate 
function. 

We  shall  suppose  that  a  code  letter  x  in  Fig.  I  is  drawn 
once  each  T  seconds  from  a  <7 ary  alphabet  X  — 
{Xt,X2,  -  ■  ■  ,X  }.  We  further  suppose  that  each  demodula¬ 
tor-output  letter  y  is  drawn  from  a  <7  ary  alphabet  X  - 
{  V,,  Y2,-  ■  ■ ,  Yj),  where  in  general  q'  >q.  Initially,  we  in¬ 
vestigate  “infinitely  soft”  decisions  for  which  q  —  00:  then 
we  study  the  penalty  for  choosing  a  smaller  value  of  q . 
The  decoder  output  letters  u  supplied  to  the  sink  are 
reproductions  of  the  encoder  input  letters  u  supplied  by 
the  source;  these  are  presumed  to  be  drawn  from  a  binary 
alphabet  %«{0, 1}.  The  rate  of  the  coding  system  in 
terms  of  the  number  of  source  digits  for  each  channel 
letter  will  be  denoted  by  R  bits  per  channel  use.  This 
means  that  R  —  R,  T  if  the  source  generates  Rt  bits  per 
second. 

The  combination  of  the  optical  modulator,  optical 
channel,  optical  detector,  and  demodulator  forms  a  dis¬ 
crete  channel  with  a  qury  input  alphabet  X  and  <7  ary 
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output  alphabet  .  By  virtue  of  the  independent-incre¬ 
ments  property  of  the  Poisson  process  and  the  constancy 
of  Xg,  this  is  a  “constant  discrete  memoryless  channel”  in 
the  sense  that  the  conditional  probability  the  channel 
output  sequence  is  -  ■  ■  b„,  where  each  b,  is  in  ul) ,  given 
that  the  input  sequence  is  •  •  an,  where  each  a,  is  in 
%,  factors  into  the  n-fold  product  of  the  per-letter  transi¬ 
tion  probabilities  according  to 

•M-Vi  “  ^.y^b*-  -,ym- hjx,  -  a,,  x2»  a2,  •••,*„- a„) 

n 

-  II  Pr[>v-Mx(-u,].  (2) 

»■  1 

Furthermore,  the  per-letter  transition  probabilities  are  the 
same  for  any  T  second  use  of  the  channel.  Thus,  if 
py[,(Y\X)  denotes  the  per  letter  transition  probability,  the 
right  side  of  (2)  is  fl". \Py\.,(b,\a,)-  The  design  of  the 
modulator  and  demodulator,  of  course,  affects  p  lr(  Y\X). 
We  shall  study  the  design  which  makes  pyU(  Y\X)  most 
favorable  for  a  given  optical  channel  and  detector.  The 
coordination  of  this  design  with  that  of  the  encoder  will 
also  be  studied. 

A  quantity  that  reflects  the  influence  of  py{]t(  Y\X)  on 
the  quality  of  a  constant  discrete  memoryless  channel  is 
the  cutoff  rate  R0  defined  by 

*o- -log,/ nun  2  f  2  (P»JY\X))'nQ(X)t\ 

l  0  re-9  L  *€'X  J  J 

--log,  min  f  2  Q(X,)Q{*,) 

{  Q  *- i y— » 

■  W>),/Jj-  (3) 

where  Q  is  a  probability  mass  function  on  Wozencraft 
and  Kennedy  [6],  in  1966,  were  first  to  argue  in  favor  of 
the  cutoff  rate  as  a  criterion  for  design  because  it  is  the 
upper  limit  of  code  rates  R  for  which  the  average  decod¬ 
ing  computation  per  source  digit  is  finite  when  sequential 
decoding  is  used.  Wozencraft  and  Kennedy  also  showed 
that  there  is  a  block  code  of  rate  R  and  codeword  length 
N  such  that  the  probability  of  error  Pr(e)  in  decoding  a 
sourceword  of  length  K  —  NR  is  bounded  according  to 

Pr(e)<2-w"'>-*),  if  R<Rq.  (4) 

Thus,  for  block  codes,  the  single  number  Rq  provides  a 
measure  of  both  a  range  of  rates  R  for  which  reliable 
communication  is  possible  as  well  as  the  coding  complex¬ 
ity,  as  reflected  by  ,V,  required  to  guarantee  a  specified 
block  error  probability.  More  recently,  Viterbi  [7]  has 
shown  for  convolutional  coding  and  maximum-likelihood 
sequence  decoding  on  the  constant  discrete  memoryless 
channel  that  the  error  probability  is  upper  bounded 
according  to 

Pr<e)<C„L2-  v*'>  if  R<R*  (5) 

where  N  is  the  constraint  length  of  the  convolutional 
code.  R  is  the  code  rate.  L  is  the  total  number  of  source 

-- 1  f~  .«■  *  nt  funrtirm  r>f 


R  and  not  a  function  of  L  and  N.  Thus,  as  with  block 
:odes,  the  single  number  R0  provides  a  measure  of  both 
-•eliable  rates  and  code  complexity.  Massey  [8],  [9]  made 
these  observations  first  and  has  used  them  to  make  an 
eloquent  and  persuasive  argument  for  adopting  R0  as  a 
modulator -demodulator  design  parameter  in  place  of  the 
more  commonly  used  error  probability.  In  what  follows 
we  shall  investigate  some  of  the  implications  of  attempting 
to  maximize  this  parameter  for  modulator-demodulator 
design  for  direct-detection  optical  communication  sys¬ 
tems. 

II.  R0  FOR  iNFtNTTF.LY  FINE  QUANTIZATION 

In  practice,  the  demodulator  of  Fig.  1  must  quantize  the 
point  process  observed  on  [0,  7"]  in  some  fashion  to  pro¬ 
duce  one  of  the  q'  output  letters  in  ^ .  This  might  be 
accomplished,  for  example,  by  counting  points  in  subin¬ 
tervals  of  [0,  T],  disregarding  their  times  of  occurrence 
within  these  subintervals.  and  then  comparing  the  subin¬ 
terval  counts  to  prescribed  thresholds.  Regardless  of  what 
form  of  quantization  is  adopted,  the  finer  it  is.  the  larger 
will  be  the  cutoff  rate  R0  of  the  resulting  constant  discrete 
memoryless  channel.  Thus  we  consider  first  the  limiting 
situation  of  infinitely  fine  quantization,  for  which  q ’  —  xs 
and  *0^*0. „  is  not  degraded  by  quantization.  Then  we 
consider  the  effect  of  finite  quantization. 

For  a  Poisson  process  with  rate  \(t).  the  probability  of 
observing  n  points  during  [0.  T ]  in  n  disjoint  intervals 
[/,,/, +  Ar,),(/2,/2  +  Ar2).  +Ar„)  is  approximated  to 

o(max,  At,)  by 

II  X(t,)jexp(  -  j  X(t)dt)At,At2  At, 
for  n  >  1  and  by 

e*p(  ~ 

for  a- 0.  Consequently,  for  infinitely  fine  quantization, 
the  summation,  call  it  f(i,j).  over  k  in  (3)  becomes 

/(i,y)-exp(-  i^r(\(f)  +  X,(f))dfj 

i+£  //•■■/  n , 

—  I  J  J  J  /-I 

where  \(f)  and  \,(t)  are  the  detection  rates  for  code  letters 
X ,  and  Xj,  respectively,  and  the  integration  is  over  the 
region  0  < f ,  < t2  <  -  -  -  <t,  <  T.  By  extending  this  range  of 
integration  to  0<t,  <T  for  /—  1,2.  ■  •  -  ,#i,  and  dividing  by 
n\  to  compensate  for  this  extension,  we  obtain 

/( i.j)  -  exp(  “  |  fo  ^  &( n  ~  g,( Of dt ). 
where  we  define  g,(/)-\l/J(/)  and  g,(r)-\l/2(ri.  Thus 

*<>.»-  "log:  min  2  V  Q(X,)Q{X,) 

\  <?  -i,-i 

cxp(- 


(6) 
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This  expression  is  identical  to  that  obtained  by  Massey 
(8.  eq.  (4)]  if  the  signal  g,(/)  were  to  be  observed  in  an 
additive  white  Gaussian  noise  of  unit  intensity  when  X,  is 
the  code  letter  into  the  modulator.  It  is  with  this  expres¬ 
sion  that  Massey  established  for  the  first  time  the  ,,,- 
optimality  under  an  average  energy  constraint  of  a  sim¬ 
plex  signal  set  for  the  additive  white  Gaussian  noise 
channel.  However,  the  additional  constraint  g,(r)>\o/2> 
0  obtains  here,  so  Massey's  argument  does  not  hold  for 
direct-detection  optical  communication  systems  and  must 
be  modified.  This  is  accomplished  as  follows. 

By  defining 

i  Q2(X.) 

I  - 1 

and  by  using  Jensen's  inequality.  Massey  [8]  shows  from 
(6)  that 

*0.„<  ->°S2  j 

1  <7 

■net 

with  equality  holding  if  and  only  if  the  quantity 

fr(g,(n-g/o)2*  (8) 

Jo 

is  the  same  whenever  /  *j.  It  is  evident  from  (7)  that  f?aa0 
is  a  monotomcally  increasing  function  of  d{j  for  / =*/  Thus, 
if  d  denotes  the  maximum  of  the  dv  for  i  #/, 


S  +  (\  -  S  )exp 


i 


1 


2(1-5) 


OQ(XJ)fgr(gl0)-g/i))^rJ 


(V 


*0  ®  <  “1o82  nun 

Q 


5  +  (l-5)exp(-idJ)lj.  (9) 


Furthermore,  it  is  easily  verified  that  the  minimizing  code 
letter  distribution  in  (9)  is  the  uniform  distribution  Q(X,) 
-  I  /  q  for  i  -  1.2.  ••  .q.  As  5*«  \/q  for  this  distribution. 
(9)  becomes 


*o.»<1°g2‘7-log2 


1  + 


(q-  Dexpj 


with  equality  holding  if  and  only  if  d0~d 


}d2)],  (10) 

whenever  is*/. 


III.  Modulator  Design  Based  on  R0  x 

An  optical  modulator  designed  to  produce  a  signal  set 
£  -  (£i(f,r),£2(f,r),  ■  ■  ,£,(f.r)}  such  that  for  is*j 

and  such  that  d  is  as  large  as  possible  produces  the  best 
overall  performance  for  the  digital  optical  communication 
system  as  measured  by  Thus  we  are  motivated  to 

examine  the  maximuation  of  d  subject  to  suitable  con¬ 
straints  on  signals  in  5 .  Associated  with  each  signal  set  § 
is  a  derived  signal  set  3  -  { gi(f).g2(/),-  •  •  ,gy(r)}  in  which 
g,(f)-\'/J(t).  where 

\(t)-(-q/h*)(  |£,(r.F)|2dr  +  V  (11) 

J  A 

Note  that  signals  in  <2  satisfy  g,(t)  >gmmm\j/2- 

This  maximization  problem  is  examined  subject  to 
additional  constraints  on  the  average  energy  and  the  peak 


amplitude  of  signals  in  the  transmitted  signal  set  § .  We 
assume  that  the  average  energy  £  of  signals  in  > .  defined 
by 

£--2  frf  \E,(t.F)\2dFdt.  (12) 

9  ,  - 1  -'o  J a 

must  satisfy 

(13) 

where  Emtx  is  a  prespecified  maximum  allowable  average 
energy.  Then  the  average  energy  Eg  for  signals  in  the 
derived  signal  set  £ .  defined  by 

£.--2  (rg2d)dt.  (14) 

9  ,  - 1  J0 

satisfies 

E'-n-sZs^,  (15) 

where  s~qE/hv  and  H-g^T-X^T  are  the  average 
number  of  signal  counts  and  noise_counts,  respectively, 
per  channel  use.  and  where  s^^qE^/ hv.  We  assume, 
further,  that  the  amplitude  jE,(t,F)l  of  each  signal  in  the 
transmitted-signal  set  S  cannot  exceed  a  prespecified 
maximum  value  Pmtx:  that  is. 

|£.(/.')I<',,m,  06) 

for  1,2, •  •  •  ,<f,  0 <t  <T,  and  for  all  locations  F  in  the 
active  surface  of  the  detector.  Then  each  signal  in  the 
derived  signal  set  9  satisfies 

gmn<g,(‘)<gmt*'  (17) 

where  gmla-\J/2  and  gm„  -  [(tj/I  /  hr)  P^  +  V  n. 

For  modulator  design,  we  thus  have  the  following  opti¬ 
mization  problem.  Select  signals  in  §  to  maximize 

dl- [9(9- I)]"'  2  2  fr[g,(0-g,i')]2dr  OS) 

1  —  1 ,  —  1 

subject  to  the  following  constraints. 

i)  Equidistance  constraint:  The  quantities  d:J  m  (8) 
should  be  the  same  whenever  is*/. 

ii)  Average-energy  constraint :  Equation  (15)  should  be 
satisfied. 

iii)  Peak-amplitude  constraint:  Equation  (17)  should  be 
satisfied. 

To  simplify  the  development  we  temporarily  neglect  the 
equidistance  constraint  in  formulating  and  solving  this 
optimization  problem.  It  will  be  evident  subsequently  that 
among  the  solutions  to  the  relaxed  problem  are  ones 
satisfying  the  equidistance  constraint,  and  these  are  then 
solutions  to  the  fully  constrained  problem. 

We  find  for  </'— 00  that  the  best  choice  of  modulator 
design  depends  on  the  particular  values  of  q.  T.  gmjB.  gm„, 
and  but  whatever  values  these  parameters  may  be. 
there  are  only  three  categories  of  best  design.  These  are 
determined  by  the  conditions 


0  .-(g2m„-gLn)T 


(19a) 


330 


IEEE  TRANSACTIONS  ON  INFORMATION  THEORY.  VOL.  IT-26.  NO.  3.  MAY  1980 


|  -  (  gfliAx  gnuo)  T *  2  (  Srrux  gmin)  9 

/  —  (  g:  -g2  )rlzi(„2  -g2  )T\ 

^  q  V  <5mu  omio/  *  -y  ^  V  O  mix  s.nin/  J' 


where  the  inequality  follows  by  virtue  of  (21),  and  equality 
even'  hoids  if  and  only  if  g,(/)  =  gm„  or  g,(/)-gram.  Finally, 
substituting  (22)  into  (20),  using  the  average-energy  con¬ 
straint  (15),  and  noting  the  conditions  for  equality  yields 
the  following  lemma. 


odd. 

(19b) 

Lemma  1:  Let 

be  the  maximum  average 

signal 

‘  J  \ 

counts  per  channel  use 

,  and  suppose  gram  <  g,(t)  <  g, 

for 

jfsLj.-Smm)7'-00)' 

q  even, 

/f=[0,Tl  and  i€ 0,2.-  ■ 

*  tq.  Then 

[*  Q  -  I  \ 

(19c) 

d2<  2 

S  CTULX  &  min  }  - 

(23) 

- (if-  —  )  7*.  00  1 , 

2q  V  Smu  omin/  1  *  J 

q  odd. 

g  +  „  n»“- 

OtnMx  o  nun  j 

We  say  that  the  “average-energy  constraint  predominates” 
when  (19a)  holds,  the  “peak-amplitude  constraint  pre¬ 
dominates”  when  (19c)  holds,  and  that  “neither  constraint 
predominates”  when  (19b)  holds. 

Average- Energy  Constraint  Predominates 


Furthermore,  equality  holds  if  and  only  if  both  a)  at  any 
time  /  e[0,  T],  all  signals  in  £  take  on  value  gnun  except  at 
most  one  which  takes  on  value  g^;  and  b) 

(24) 


2  /V(')‘*-gLn7'= 

9  ,  - 1  Jo 


We  first  give  an  upper  bound  on  d 2  that  holds  regard¬ 
less  of  which,  if  any,  constraint  predominates.  Then  we 
identify  a  modulator  design  that  achieves  this  upper 
bound  when  the  average-energy  constraint  predominates. 

Suppressing  the  common  argument  t  of  all  entities,  we 
have 

(  g,  gj )  —  (  gi  gmin)  2(  g(  gnun)(  ^  gnun)  (  gy  gmm) 
4  (  g,  g min )  "^(gj  gnun)  ’ 

the  inequality  holding  because  ( g,  -  gmB)  >0  for  all  /  € 

{ 1.2.  ■  •  •  ,q).  Furthermore,  for  i  =*/,  equality  holds  if  and 
only  if  at  most  one  of  g,  and  g,  is  strictly  greater  than  g„.„. 
Summing  over  t¥>j,  then  over y  €{l,2,-  •  ,q),  integrating 
over  [0.  TJ  and  dividing  both  sides  by  q(q—  1)  yields 


The  condition  a)  for  equality  is  simply  a  combination  of 
the  conditions  for  equality  of  (20)  and  (22).  while  condi¬ 
tion  b)  is  the  condition  for  equality  in  (15). 

A  signal  set  that  is  equidistant  and  achieves  the  upper 
bound  in  Lemma  1  with  equality,  and  which  therefore 
maximizes  the  cutoff  rate  x  when  the  average-energy 
constraint  predominates,  is  characterized  in  the  following 
lemma. 

Lemma  2:  If  im„  satisfies  (19a),  equality  is  achieved  in 
(23)  by  the  equidistant  pulse-position  modulation  (PPM) 
signal  set 

,  (  g«„.  (i-l)T/q<t<(i-l  +  e)T/q, 

8>  ‘  [  gmm*  otherwise  for  0  <  t  <  T. 

(25) 


~fri  s  [*«>-*«>]'* 

~  1  *  •'0  ,-  |  y-  1 

<^fri{g,(0-gmo]1dt.  (20) 

9  Jo  ,  -  i 


9<9 


with  equality  holding  if  and  only  if.  for  almost  all  t, 
g.( ,)>gmm  for  at  most  one  value  of  /  in  {1.2.---  ,q).  Now 
for  any  ie(1.2,  •  .q)  and  any  /e[0.7], 

[$,(') -gmm][gm«-g,(')]>0. 

which  yields 

Si  S  (rum  &min  ]  ^  ^mu  %  min*  (21) 

with  equality  if  and  only  if  g,(t)  -gmtn  or  We 

then  have 


L  Si(  1  )  ] 


S  mu  S\ 


0  ~  +  Snun)&(  ')  ] 


+  I  -  ' 


2Sn 


S mu  &tt 


2Smitl 

S  mu  %  mm 

j”  S  max  ^ min 

[  £m«*  £m»n 


(  ^rau  %  mm  ^ 


g,J(0  +  g^u 

8  m**  &mjn 


&  max 


m*x  <5  min 


g.2(0  +  gi 


[  g,J(')-g^n]. 


(22) 


where 


i  ■ 


(26) 


To  establish  Lemma  2.  note  that  the  PPM  signal  set  is 
clearly  equidistant  and  that  condition  (19a)  is  equivalent 
to  <  <  1  so  that  condition  a)  in  Lemma  1  is  satisfied. 
Moreover  the  average  number  of  signal  counts  per  chan¬ 
nel  use  for  the  PPM  signal  set  is  given  by 


f-ll  fr8*2U)dt-g2auBT^  ~(g2mkx~g, 
9  ,-i  Jo  9 


1  ) 

min/ 


Hence,  from  (26),  i*  — iraiu,  so  condition  b)  of  Lemma  I  is 
also  satisfied.  Consequently,  by  Lemma  1.  d2  for  this 
signal  set  equals  the  upper  bound  in  (23).  It  is  straight¬ 
forward  to  verify  by  direct  calculation  for  this  PPM  signal 
set  and  e  <,  I  that  d2  equals  the  upper  bound.  Lemma  2 
follows,  and  we  conclude  that  the  equidistant  PPM  signal 
set  (25)  maximizes  R0  ae  when  the  average-energy  con¬ 
straint  predominates. 

Lemma  2  can  be  strengthened  by  noting  that  the 
equidistant  PPM  signal  set  (25)  is  the  unique  signal  set 
that  achieves  equality  in  (23)  modulo  shifting  or  splitting 
of  pulses  while  keeptng  them  nonoverlapptng  and  keeping 
the  total  "on-time”  of  any  g,  equal  to  iT / q.  This  is 
because  condition  a)  of  Lemma  1  is  satisfied  if  and  only  if 
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the  pulses  are  nonoverlapping  and  because  <  <  1  is  chosen 
precisely  to  use  up  all  the  available  energy,  as  required  by 
condition  b)  of  Lemma  1. 

Peak-Amplitude  Constraint  Predominates 

By  this  we  mean  that  the  energy  constraint  ( 1 5)  is  not  a 
limiting  consideration.  We  therefore  neglect  it,  as  well  as 
the  equidistance  constraint,  and  consider  the  problem  of 
maximizing  d2  in  (18)  subject  only  to  (17).  The  average 
energy  required  by  the  signals  that  solve  this  problem  will 
then  provide  conditions  for  dominance  of  the  peak-ampli¬ 
tude  constraint,  and  among  the  solutions  to  the  relaxed 
problem  are  ones  satisfying  the  equidistance  constraint. 
These  are  then  solutions  to  the  fully  constrained  problem. 

In  the  Appendix,  we  derive  the  following  upper  bounds 
on  d2: 

\  <?(<?- ir'CSmu-gmJ2?'-  <7  even,  (27a) 

<7  odd.  (27b) 

An  alternative  and  simpler  derivation  for  q  even  is  as 
follows.  For  any  choice  of  gs, 

( g,  ~  gj)1  -  ( &  "  g,f  ~  2(  g,  ~  g,)(  gj  ~g,)  +  (gj~  g,)1* 

so  that  summing  over  i  and  j  yields 

2  2  u,-gy)J*2<7  2  (g.-g,)2- V(c~g,)2, 

.-i 

where  c  is  the  centroid  c  ~  q~  ,  g,.  Thus,  from  (18), 

d2-Uq-l)~'  fr£  [g,(t)-g,(‘)]2dt 
J0  1-1 

-2<7(<7-l)'1  fr[c(/)-,gt(r)]2<a 
-'0 

q 

<2(<?-D"fr2  [&(')-*, (')]*<*.  (28) 

j0  ,m  1 

with  equality  holding  if  and  only  if  c(/)»g,(/)  for  almost 
all  t  e(0,  7*1.  Takrng  implies  |  g,(f)  — 

g,  I  <  *(  WW)-  ^  die  bound  in  (27a)  then  follows 
from  (28).  This  bound  holds  for  both  odd  and  even  values 
of  q,  but  it  is  tight  only  for  q  even,  and  the  more  precise 
bound  (27b)  derived  in  the  Appendix  for  q  odd  is  the  one 
that  is  achieved  with  equality. 

Any  set  of  q  equidistant  signals  <3  satisfying  (17)  and 
achieving  the  upper  bound  (27a)  for  q  even  or  (27b)  for  q 
odd  is  a  signal  set  maximizing  Ao  Signal  sets  having 
these  properties  can  be  identified  for  certain  values  of  q 
by  the  following  procedure.  Partition  {0,  T]  into  m  equal 
subintervals,  and  define  m  functions  p/t),  «"  1,2, -  •  • ,  m, 
that  are  piecewise  constant  haying  a  constant  value  of  one 
or  zero  over  each  subinterval.  Then  p,(f)  can  be  identified 
by  a  binary  codeword  of  length  m  bits.  If  we  write 
g'W-gaM  +  P/'XgmAA-gmJ'  ,l  is  enough  to  find  q  bi¬ 
nary  codewords  of  length  m  whose  common  Hamming 
distance  satisfies  the  conditions  in  Table  I.  The  last  col- 


TABLE I 

Code  Constraints 


1 

n 

Hamming 

Distance 

m 

i-l 

even 

q-i 

A 

even 

-fq-1 ) 

q 

A 

odd 

q 

;,(q+ 1) 

odd 

q 

q+l 

jiq-1) 

umn  in  this  table  reflects  a  necessary  condition  for  opti¬ 
mality  that  follows  immediately  from  conditions  for  equal¬ 
ity  in  (Al)  that  yields  the  upper  bound  (27);  namely,  for 
all  te  [0,7], 

i)  for  q  even,  q/2  of  the  signals  take  on  value  g^  and 
the  remaining  q/2  value  gmn, 

ii)  for  q  odd,  (q—  l)/2  or  (q  +  2)/2  of  the  signals  take 
on  value  g ^  and  the  remainder  g^. 

This  provides  an  additional  check  on  the  optimality  of  the 
following  signal  set  and  was  an  important  aspect  in  our 
identification  of  it.  For  optimality,  however,  it  is  sufficient 
that  the  signal  set  be  equidistant  and  achieve  the  ap¬ 
propriate  upper  bound  (i.e.,  Hamming  distance). 

For  q  a  multiple  of  four  and  such  that  a  Hadamard 
matrix  of  order  q  exists,  q  codewords  satisfying  these 
conditions  are  easily  obtained  by  deleting  the  first  column 
(all  ones)  of  the  normalized  Hadamard  matrix  [10],  [11]. 
From  this,  s »  q  -  I  codewords  satisfying  the  third  row  of 
Table  I  with  s  replacing  q  can  be  obtained  by  deleting  the 
codeword  of  all  ones.  Also,  p<*\q  codewords  satisfying 
the  second  row  of  Table  I  with  p  replacing  q  can  be 
obtained  by  deleting  all  rows  of  the  normalized  Hada¬ 
mard  matrix  that  have  a  zero  in  (say)  the  second  column 
and  the  deleting  the  first  two  columns.  From  this,  j  — 
\q—\  codewords  satisfying  the  fourth  row  of  Table  I  with 
s  replacing  q  can  be  obtained  by  deleting  the  codeword  of 
all  ones.  Since  Hadamard  matrices  for  q~  1.2,  or  a  multi¬ 
ple  of  four  are  known  up  to  ^  —  200  (and  for  many  larger 
values),  this  procedure  gives  an  optimizing  signal  set  s  for 
all  q  <  200.  Also  several  infinite  families  of  Hadamard 
matnces  are  known,  for  example  if  2*  for  some  posi¬ 
tive  integer  k:  these  coincide  with  cyclic  maximal-length 
shift  register  codes,  and  they  are  also  a  subset  of  the 
first-order  Reed-Muller  codewords  of  this  length. 

We  remark  that  complementation  of  an  optimum  signal 
set  yields  another  optimum  signal  set.  Also  time-shanng 
of  any  two  optimum  signal  sets  yields  another  optimum 
signal  set. 

The  average  energy  of  these  signal  sets  is  easily  calcu¬ 
lated.  If  q  is  even,  at  any  time  q/2  signals  have  value  g^^ 
and  the  remainder  g^,  so 

£,-{ULx  +  sL,)7''  (29) 

which  implies 

i'“£,-«L0T'=  |(«m„-«Ln)7'.  (30) 
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Similarly  for  q  oda. 


which  implies 


{<<?-  >)g; 


(<7+  >>g; 


(31) 


(32) 

This  uses  less  energy  than  talcing  (^+l)/2  signals  with 
value  and  thus  extends  the  range  of  average  energies 
for  which  this  choice  is  optimum;  namely,  the  available 
average  energy  must  exceed  that  required  for  s  of  (30)  or 
(32).  which  yields  condition  ( 19c). 

Finally,  the  distance  d *  achieved  by  these  signals  that 
maximize  R0  x  when  the  peak-amplitude  constraint  pre¬ 
dominates  is  given  by 


-7 

8  max  8  tntn 

q-  1 

^miA  8 rrun 

<7+  1 

8  mu  &rrun 

q-  1 

8 m»Jt  &min 

q  even, 
q  odd. 


(33) 


appropriate  Hadamard  matrix,  as  discussed  in  the  previ¬ 
ous  section  with  T  replaced  by  (1  -  A )T. 

Lemma  3  is  proved  as  follows.  For  an  arbitrary  choice 
of  a  6(0, 1]  and  an  arbitrary  choice  of  maximum  average 
energy  i,  allocated  to  the  interval  (O.aT],  we 

have  from  Lemma  1 


-?(<?-•) 


■ —  r 

-l)Jn 


2  2  [ g,(r>-g,ur 

,-l y-l 


dt 


<2 


g  mix  gn 


(35) 


and  from  (27") 


—  - -T7  /  T  S  2  [g, ('»-«, <'>]'<* 
q(q~  1)  ,_i 


( I  -  a)  Tq 


2(17-1) 
(l-q)r(g+l) 
2-7 


(  g m*x  g min  )  • 


(  £max  £man)  • 


q  even.  (36a) 
q  odd.  (36b) 


Neither  Constraint  Predominates 


Adding  these  expressions  and  using  (18).  we  obtain 


If  satisfies  (19a)  or  (19c),  the  PPM  signal  set  or, 
respectively,  the  Hadamard-derived  signal  set  maximizes 
/?„.„•  Unless  <7*  2  or  <7  =  3.  we  are  left  with  a  range  of 
values  of  f ^  for  which  a  solution  has  yet  to  be  identified. 
This  “gap”  region  is  specified  in  (19b).  For  q-  2  or  <7-3, 
this  region  collapses  to  the  empty  set,  and  at  the  common 
upper  limit  of  the  range  (19a)  and  lower  limit  of  range 
(!9c),  the  PPM  or  Hadamard-denved  signal  sets  are 
equivalent  and  optimum.  For  <7  >4.  we  now  demonstrate 
that  an  optimum  signal  set  results  by  time  sharing  the 
PPM  and  Hadamard-derived  solutions. 

The  gap  region  has  strictly  positive  length  if  <7  >  4,  and 
then  any  point  in  either  interval  (19b)  can  be  expressed  as 
a  strictly  convex  combination  of  the  endpoints;  that  is,  for 
q  even  and  in  the  appropriate  interval  (19b),  there 
exists  a  unique  A£(0. 1)  such  that 

(gLt-sUT-  <34a> 


A  (1-A) 
<7  2 


d2< 


8m mx  8mm 
8  mix  <?min 

q  even, 

8  mix  8  mm 


(1  -a)Tq  _  2 

2(q—  1  )  &  min'  • 


(37a) 


8  r 


ux  g min  J 

(1  —  a)  7~(g  +  1 )  | 
2-7 


g  max  g  it 


|  <7  odd.  (37b) 

From  Lemma  1.  equality  holds  in  (35)  if  and  only  if  the 
following  pair  of  conditions  hold. 


1)  At  any  time  /GlO-aT].  all  signals  take  on  the  value 
gmm'  except  at  most  one  which  takes  on  the  value 
Sm.x-  This  implies  that  the  average  energy  i,  used  on 
[O.arjis 


1  ^  far  , 

-  1 J  g, 

-7  ,  -  1  Jo 


l‘)dt-g:mnaT  <, 


£rr 


while  for  q  odd  and  smAX  in  the  appropnate  interval  (19b), 
there  exists  a  unique  A  e  (0. 1 )  such  that 


s 


max 


A  (\-\)(q-\) 
<7  2<? 


(34b) 


“>  *l=il.m«- 

Thus  a  necessary  condition  for  equality  in  (35)  is 
aT  :  \ 

■^l.max^  q  (  ?m«x  &mm/‘ 


(38) 


An  optimum  choice  of  modulation  can  now  be  given  in 
terms  of  A. 

Lemma  3:  For  q  even  (respectively,  odd)  and  s ^  in 
the  appropriate  interval  specified  in  (19b),  let  A  be  defined 
by  (34a)  (respectively.  (34b)).  Then  an  equidistant  signal 
set  that  maximizes  /?„  „  while  satisfying  the  average-en¬ 
ergy  and  peak-amplitude  constraints  with  equality  is  for  a 
fraction  A  of  the  interval  [0,  T]  use  the  “full-width"  PPM 
signal  set  (25)  with  t  -  I  and  T  replaced  by  XT,  and  for  a 
fraction  1 -A  of  (0.  T]  use  the  signal  set  defined  by  the 


Furthermore,  the  derivation  in  the  Appendix  shows  that 
equality  holds  in  (36a)  only  if  half  of  the  signals  take  on 
the  value  gmin  and  the  remainder  gmax  This  involves  an 
average  energy  usage  i:  on  [a  T,  T]  of 

*:=-  2  f  Tgh  Odt  -  g^,,n(  1  -  a )  T 

-7  ,-r«r 

(1  -a)T,  ,  _  , 

.(max  Kmin 


2 


(39) 
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Because  of  the  total  average  energy  constraint  ij  +  s2  < 
we  have,  using  (34a). 

J'i  <•>,.,«  -  h  - 1  ^  +  ~~2~  Y  ~  ^  T  <40) 

For  equality  to  hold  in  (37a).  it  is  necessary  that  both  (35) 
and  (36a)  hold  with  equality,  and  necessary  conditions  for 
these  are  in  turn  (38)  and.  combining  i,  with  (40), 

*i.«ua<(^  +  ^y^jUmax-sL,)7'-  ‘7even-  (4!a> 


For  q  odd.  the  corresponding  necessary  conditions  for 
equality  in  (37b)  become  (38)  and 


X  t  (q-\)(4T-l) 
q  2  q 


q  odd. 
(41b) 


We  now  consider  the  selection  of  and  a  to  maximize 
the  upper  bound  (37a)  subject  to  the  constraints  (38)  and 
(41a),  which  are  necessary  conditions  for  it  to  hold  with 
equality.  Because  both  (38)  and  (41a)  are  constraints  on 
J,  mix,  we  consider  each  in  turn  to  be  dominant  in  the 
sense  of  being  more  restrictive.  The  bound  (38)  is  less  than 
or  equal  to  the  bound  (41a)  if  and  only  if  a  Substitut¬ 
ing  (38)  into  (37a)  and  simplifying,  we  obtain 

a- 


d2< 


(  8mAX  &tmn) 

2*<*-D 


[  <72  —  a(<7  -  2)j. 


Because  we  are  considering  q  >  4,  this  bound  is  maximized 
over  ae[X,  1]  by  the  unique  choice  a -A.  Similarly  the 
bound  (41a)  is  less  than  or  equal  to  the  bound  (38)  if  and 
only  if  a  <  A.  Substituting  (41a)  into  (37a)  and  simplifying, 
we  obtain 


a(q-2)  q  +  X(2  —  q) 
2(q-\)  2(?-l)  q 


Again  because  q  >  4.  this  bound  is  maximized  over  a  € 
[O.A]  by  the  unique  choice  a —  A.  Thus  the  bound  (37a)  is 
maximized,  subject  to  the  necessary  conditions  (38)  and 
(41a),  by  talcing  a -A,  and  the  corresponding  maximum 
bound  is 


d2< 


<?0-A) 

2(9-1) 


q  even.  (42) 


But  this  upper  bound  is  readily  achieved  by  the  time-shar¬ 
ing  of  a  ‘‘full  width"  PPM  signal  set  for  a  fraction  A  of 
[0.  T)  and  the  Hadamard-derived  signal  set  for  the  remain¬ 
ing  fraction  1-A  of  [0.  T],  Furthermore  the  average  en¬ 
ergy  required  by  this  solution  is  exactly  i^.  For  q  odd,  a 
similar  analysis  leads  again  to  the  unique  choice  a  “A  and 
the  corresponding  maximum  bound 


d2< 


2A  |  (1— AXq-t-l) 
q  2q 


q  odd. 


( 


PREDOMINANT  CONSTRAINT 
Fig.  2.  d1  for  optimal  signal  sets. 


Hadamard-matnx-denved  signal  set  for  the  remaining 
fraction  1  -  A  of  [0.  T ),  and  the  average  energy  required  by 
this  solution  is  exactly  smAX,  as  before. 


IV  Efficient  Energy  Utilization 


Denote  by  A,  the  count  rate  due  to  the  signal  alone 
when  it  is  “on”  for  any  of  the  optimal  signal  sets  derived 
in  the  previous  section.  Then  g^-A.-t-A,,  and 
where  A^  is  the  count  rate  due  to  the  noise  alone.  In 
considering  designs  for  efficient  energy  utilization  we  dis¬ 
tinguish  three  situations  depending  on  which  of  i^, 
and  t  are  adjustable  and  which  are  fixed.  We  seek  to 
identify  values  of  the  adjustable  parameters  so  that  the 
cutoff  rate  per  unit  energy,  R0oo/s,  is  greatest. 

1)  adjustable,  \  fixed:  The  value  of  d2  achieved 
with  the  optimal  signal  sets  of  the  previous  section  is 
shown  in  Fig.  2  as  a  function  of  smmx,  assuming  that  X,  is  a 
fixed  constant.  Here  d2  is  a  piecewise  linear  function  of 
with  the  following  parameters: 

d,2-|[l-(H-AJ/Ao),/2]2A0r,  (44) 


2^TT)[l-(l+AI/A0),/2]2A0r. 

2^Z7)[i-(,+a .A>),/2]V. 

2(A0/\)[|-(1+A1/A0)i/2]j. 

^j(V\)[l-<l+VAo)l/2]2. 

T-?  +  !(W[i-I'H/>o)1/2]‘  <?od  d. 

9-49  +  3 

(47) 

Using  (10),  with  equality  for  optimal  stgnal  sets,  and  using 
the  expressions  for  d2  implied  by  Fig.  2  and  (44)-(46),  we 
conclude  that 


q  even. 

(45) 

q  odd. 

(46) 
q  even. 


(43) 

This  bound  is  achieved  by  the  time-sharing  of  a  full  width 
PPM  signal  set  for  fraction  A  of  [0.  T ]  and  the  appropriate 


<**o.»/<tfm«”0-72(‘?- !)[(<?-  l)  +  expj|d2j  (slope). 


(48) 
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Fig.  3.  Cutoff  rate  as  a  function  of  signal-to-notse  ratio. 

where  the  factor  “slope”  is  r„  s2,  or  zero  depending  on 
which  constraint,  if  any,  predominates.  Thus  dRn  „/ ds„.. 
decreases  monotonically  with  increasing  i^,,  so  the  signal 
energy  is  used  most  efficiently  when  is  small,  where 
the  energy  constraint  predominates  and  where  the  PPM 
signal  set  is  optimal  and  utilizes  energy  i  —  im„.  This 
situation  is  analogous  to  that  studied  by  Massey  [8],  [9]  for 
the  additive  Gaussian  noise  channel.  For  s  —  im„  small, 
we  conclude  that 

«0k./i*1.44^i(V\)[  1  -d+  \/\>)W2}2'  (49) 

with  equality  for  i— 0.  Hence 

*o..  <  1-44^(V\)[  1-0  +AJ/A0),/J]Js  (50) 

is  an  upper  bound  on  RQaB  for  any  choice  of  s  and  any 
choice  of  modulation  with  near  equality  holding  when  s  is 
small  and  for  the  PPM  signal  set.  Since  i  —  iT\/ q  for  the 
PPM  signal  set,  this  means  that  when  \  is  fixed,  the  most 
efficient  energy  usage  occurs  for  narrow  pulses,  c  being 
selected  to  be  as  small  as  is  practically  feasible. 

2)  \  adjustable ,  s  fixed:  By  a  somewhat  messy  but 
straightforward  calculation  it  is  readily  verified  that 
dR^  x/d\>0  for  s  fixed.  Thus  /?0  ao,  and  hence  R0a>/s 
for  s  fixed  is  a  nondecreasing  function  of  \.  Consequently 
the  most  efficient  energy  usage  is  achieved  by  selecting  \ 
large  and  operating  in  the  region  where  the  energy  con¬ 
straint  predominates.  This  implies  using  the  PPM  signal 
set  with  as  large  a  value  of  signal  count  rate  \  as  practical 
and  sufficiently  narrow  pulses  that  s  —  tT\/q. 

3)  I  adjustable,  PPM  signal  set  with  «  fixed:  The  PPM 
signal  set  with  a  fixed  pulsewidth  tT/q  maximizes  R^^ 
provided  the  energy  constraint  predominates,  which  we 
assume.  For  t  fixed  and  i- tT\/q,  we  find  that 

Re.  «(•*•*.«)  “  1°82  <1  ~  l°82  ( 1  +(?-!) 


where  we  define 


to  be  the  “effective”  average  number  of  noise  counts  per 
channel  use  and  where 

“eff-V'trff  (52) 

is  the  signal-to-noise  energy  ratio.  Graphs  of  R0  „(s,  ne„) 
as  a  function  of  arf(  for  several  values  of  are  given  in 
Fig.  3.  These  graphs  are  seen  to  increase  monotonically 
with  aM  for  each  fixed  value  of  nM.  Thus  as  expected  the 
performance  improves  systematically  for  fixed  fiM  as  the 
average  signal  energy  per  channel  use  s  increases.  How¬ 
ever,  although  starting  from  i—0  the  performance  initially 
improves  rapidly,  there  is  a  point  of  diminishing  returns 
after  which  there  is  only  marginal  improvement  for  fur¬ 
ther  increases  in  i.  For  each  nrff,  there  is  an  i—J*(/Feff) 
such  that  for  all  s  >  0  there  holds 

*0,  ^  Rq.Js^c)  (53) 

s  s* 

This  value  of  i  can  be  found  graphically  for  each  nM  by 
pivotting  a  vertical  line  about  the  origin  (/t0.«>  “0,  a^r  ”  0) 
in  Fig.  3  until  it  lies  tangent  to  the  graph  of  R0  ^(s,  nM). 
The  abscissa  of  the  point  of  tangency  is  Inequality 

(53)  holds  because  the  graph  of  ^O.aoC**rt«fr)  lies  on  or 
below  the  line  so  constructed  for  all  aeff>0.  It  follows 
from  (53)  that  the  most  efficient  usage  of  energy,  in  the 
sense  that  the  cutoff  rate  per  unit  energy  is  greatest,  is 
achieved  when  i«i*.  The  dashed  line  in  Fig.  3  is  a  fit  of 
s* /hM  versus  obtained  graphically  by  connecting 
together  the  points  of  tangency  described  above.  From 
this  fit  we  find  for  the  range  of  average  noise  counts  in  the 
figure  that  s*  and  are  approximately  related  by  the 
following  power  law: 

fas 2.349^52.  (54) 

This  is  shown  as  the  solid  line  in  Fig.  4.  A  measure  of  the 
range  of  energies  nearly  as  efficient  as  s*  can  be  de¬ 
termined  for  each  from  the  values  of  arf(—  s/hM  in 
Fig.  3  for  which  R <j  »>(•*>  "eii)  is  close  to,  say  within  ten 
percent  of,  the  ordinate  of  the  line  of  tangency  con¬ 
structed  as  above.  Values  of  s  within  the  dashed  lines  in 
Fig.  4  satisfy  this  ten  percent  condition;  Fig.  4  implies 
that  for  maximally  efficient  energy  utilization  s  should  be 
kept  within  about  ±  2  dB  of  i*. 
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V.  Effect  of  Finite  Output  Quantization 

The  cutoff  rate  decreases  from  R0 as  the  dimension  q' 
of  the  output  alphabet  decreases.  This  degradation  is 
greatest  for  a  binary  input  alphabet  (<?  — 2)  when  q'm2, 
which  corresponds  to  making  bit  by  bit  decisions  without 
any  coding.  For  a  Gaussian  model  Massey  [8],  [9]  con¬ 
cludes  that  choosing  q' »  2  results  in  a  quantization  loss  of 
2.04  dB;  that  is,  in  the  efficient  range  of  energy  utilization 
for  the  Gaussian  model,  the  energy  per  channel  use  must 
be  about  2  dB  greater  for  q '">2  in  order  to  achieve  the 
same  cutoff  rate  as  when  q'  —  oo.  Massey  also  concludes 
that  for  q'~8  there  is  virtually  no  quantization  loss.  The 
degradation  for  the  Poisson  model  is  somewhat  smaller 
than  that  found  by  Massey  when  5  <  hcfl  <  40  and  is  of 
about  the  same  order  when  —  I. 

Suppose  the  input  and  output  alphabets  are  9C  «■  {0.1} 
and  ^  «•  {0, 1 },  so  that  q<~q'~2.  We  adopt  a  binary 
pulse-position  modulation  format  with  pulses  of  duration 
tT/ 2  because  this  maximizes  R0  x  when  the  energy  con¬ 
straint  predominates.  For  this  choice  each  symbol  interval 
is  divided  into  two  equal  subintervals  and  output  letters 
are  generated  according  to  “produce  one  if  n[Q,tT/2]< 
n[T/2,(\  +  i)T/2],  otherwise  produce  zero,”  where 
n(0,«r/2]  and  n[T / 2,(1  +  t)T /2]  are  the  number  of  points 
observed  in  the  first  and  second  signalling  interval,  respec¬ 
tively.  Here  n[0,eT/2]  and  n[T/2,(i  +  ()T/2)  are  indepen¬ 
dent  Poisson  random  variables  with  mean  parameters 
i  +  (nrff/2)  and  nM/ 2,  respectively,  when  zero  is  the  input 
letter  and  nrft/2  and  j+(nef(/2),  respectively,  when  one  is 
the  input  letter.  As  in  the  previous  discussion  i  and  are 
the  average  number  of  signal  counts  received  per  channel 
use  and  effective  number  of  noise  counts  received  per 
channel  use.  It  is  straightforward  to  conclude  for  these 
assumptions  that  the  cutoff  rate  is  given  by 

«o.,-2-l-l°g2{>+2[p(l-p)],/:!}.  (55) 

where  p  is  the  binary  error  probability  associated  with 
producing  an  output  symbol  1  (or  a  0)  when  the  input 
symbol  is  a  zero  (or  a  one,  respectively).  This  error  proba¬ 
bility  is  given  graphically  for  certain  values  of  n^t  and  a 
range  of  i  by  Pratt  [4,  p.  209:  identify  h^,  -  2pHB  and 

Ps.aJ- 

The  values  tabulated  in  Table  II  were  obtained  as 
follows:  1)  i*  is  obtained  from  (54)  for  each  n^,;  2)  p*  is 
the  value  of  p  in  (55)  such  that  0<p*  <  1  and  ^o,»-2“‘ 


TABLE  II 

Degradation  due  to  Finite  Quantization 


"•If 

9 

R* 

V- 

p* 

s 

101ogl»/»#1 

i 

2.  15 

0.33 

0.038 

3.3 

2.09 

5 

4.86 

0.65 

0.020 

?.o 

1 .  ;8 

10 

6.  &5 

0.68 

0.  016 

*.25 

1  .  -3 

20 

«.  10 

0.  70 

0.014 

1 2.7 

l  ^ 

40 

12.45 

0.71 

0.011 

L6.9 

l.  33 

Rq  ^,;  and  3)  s  is  obtained  by  interpolation  from  the  graph 
given  by  Pratt.  Thus  s*  and  s  of  the  table  yield  the  same 
cutoff  rate  for  <7  —00  and  <?' —  2,  respectively.  To  within 
the  accuracy  that  the  interpolation  step  can  be  accom¬ 
plished,  we  conclude  that  about  1.5  dB  more  signal  energy 
is  required  with  hard  decisions  than  with  infinitely  soft 
decisions  for  h -f  in  the  range  of  five  to  40  counts  per 
channel  use. 

VI.  Effect  of  Input  Alphabet  Dimension 

For  an  input  alphabet  of  dimension  q,q'  —  00,  and  an 
average  energy  constraint  that  predominates,  qary  pulse- 
position  modulation  maximizes  the  cutoff  rate.  We  now 
consider  the  effect  of  q  in  each  of  the  three  situations 
identified  in  Section  IV. 

1)  smMMt  adjustable ,  X,  fixed:  From  (48)  and  (49),  increas¬ 
ing  q  from  two  to  infinity  implies  that  the  greatest  rate  per 
unit  energy  that  can  be  achieved  increases  by  a  factor  of 
two.  Moreover,  examination  of  the  graphs  of  R^^/s  for 
Rq.x  fPven  by  (10)  with  equality  and  with  d2  —  sis,  where 
j,  is  given  in  (46),  shows  that  the  range  of  values  of  s  for 
which  the  approximation  (49)  holds  closely  increases  as  q 
increases;  in  other  words,  the  range  of  efficient  signal 
energies  is  extended  as  q  is  increased. 

2)  X,  adjustable,  s  fixed:  For  \  large  and  the  PPM 
signal  set,  we  see  from  Fig.  2  and  (46)  that  d2^2s.  Then 

*o.«;tolog2?-lo82[I+(‘7-|)e"i]  <  <56) 

Hence,  for  large  X,,  R^ ^/si^iq  - \)/ q  and  therefore  the 
largest  rate  per  unit  energy  increases  by  no  more  than  a 
factor  of  two  as  q  increases  from  two  to  infinity. 

3)  s  adjustable ,  PPM  signal  set  with  e  fixed:  A  graph  of 
(51)  as  a  function  aM  for  hcll  —  16  and  various  values  of  q 
is  shown  in  Fig.  5.  For  each  q,  there  is  a  corresponding 


Fig.  5  Effect  of  input-alphabet  dimension  00  cutoff  rate 
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signal  energy  that  is  most  efficient;  this  can  be  found 
graphically  in  the  same  manner  as  before,  as  indicated  by 
the  lines  of  tangency.  These  efficient  energies  depend 
upon  q\  very  roughly  we  find  from  the  graphs  that 
(s*/l6)qasl,  so  that  the  most  efficient  signal  energy  de¬ 
creases  as  q  increases.  This  implies  a  significant  potential 
improvement  in  performance  at  low  signal  energies  by 
using  a  large  input-alphabet  dimension  q  and  qaxy  pulse- 
position  modulation.  These  observations  appear  to  hold 
for  other  values  of  as  well. 

VII.  Effect  of  Random  Detector  Gain 

Let  { M(t );  /  >  0}  be  a  compound  Poisson  counting 
process  defined  by 

N(') 

M(')-  2  (57) 

n  ■»  0 

where  { iV(r);  t>  0}  is  the  Poisson  counting  process  de¬ 
fined  above,  uo*»0,  and  ( u„ ;  n  —  1,2,-  •  • }  is  a  sequence  of 
independent  identically  distributed  random  variables  each 
having  an  integer  value  greater  than  zero.  Here  [N(t)\ 
t  >  0)  models  primary  photoelectron  conversions,  and  u„ 
models  the  number  of  secondary  electrons  appearing  at 
the  detector  output  due  to  the  nth  conversion.  This  ran¬ 
dom  gain  is  an  important  effect  encountered,  for  example, 
with  avalanche  detectors  used  in  optical-fiber  communica¬ 
tion  systems. 

In  considering  a  digital -data  communication  system  in 
which  measurements  are  derived  from  { M(t );  /  >0},  it  is 
of  interest  to  known  the  cutoff  rate  R^^  for  infinitely  fine 
quantization.  As  before,  this  quantity  then  places  an  up¬ 
per  limit  on  the  performance  of  any  receiver  employing 
finite  quantization,  such  as  an  “integrate  dump”  receiver 
[12],  [13]  in  which  M(nT)—  M\(n  —  1)T]  is  used  to  make  a 
decision  about  the  nth  transmitted  symbol. 

We  find  M  to  be  identical  to  that  in  (6),  so  random 
detector-gain  neither  degrades  nor  enhances  the  cutoff 
rate  for  infinitely  fine  output  quantization.  This  is  because 
the  distribution  of  the  random  gains  is  unaffected  by  the 
choice  of  transmitted  signal  on  our  model  and  can  be 
verified  mathematically  by  the  following  steps.  First  we 
write  the  summation  over  k  in  (3)  as /(/J)- £;[A‘j2(>')], 
where 

\J(Y)“P,lAY\X,)/P*Ay\Xj)  (58) 

is  the  likelihood  ratio  for  symbol  X,  relative  to  symbol  Xj 
and  Ej(  )  denotes  a  conditional  expectation  given  Xy  As 
the  output  quantization  is  refined,  this  becomes 

AiJ)- £,[A‘jJ(A/(0;  0  <  /  <  T)],  (59) 

where  A tJ(M(ty,  0 <t<T)  is  given  by  the  ratio  of  the 
sample  function  densities  [18]  of  (A/(r);  0  <  /  <  7"}  for 
symbols  X,  and  Xj.  This  likelihood  ratio  is  found  not  to  be 
a  function  of  the  random  gains,  and  the  assertion  follows. 

A  consequence  of  this  assertion  is  that  many  of  the 
conclusions  reached  in  the  preceding  sections  also  apply 


in  the  presence  of  random  detector- gain.  At  the  present 
time  there  are  too  few  published  results  on  the  binary 
error-probability  for  an  integrate-and-dump  receiver  for 
us  to  examine  the  potential  benefits  of  employing  finer 
output  quantization,  but  this  is  a  matter  of  some  practical 
interest  for  fiber-optic  systems. 

VIII.  Polarization  Modulation 

Suppose  that  binary  orthogonal  polarization  modula¬ 
tion  can  also  be  employed  in  the  optical  modulator  of  Fig. 
I  in  addition  to  temporal  modulation.  Then  the  scalar 
field  £(f,r)  becomes  a  vector  (£,(/, r),£2(f,r))  in  which 
one  component  is  the  0°  field  and  the  other  one  the  90° 
field.  A  polarization  decomposition  of  the  received  field 
followed  by  direct  detection  in  each  channel  then  results 
in  two  independent  point  processes,  which  we  label  V,(r) 
and  V2(f),  0 <t<T.  Assume  that  when  the  input  code 
letter  is  XiG.{Xl,X2,-  ■  ■  ,Xq]  that  the  count  rate  for  i V,(r) 
is 

A|,(/)-r,  jO  +  K-slO).  (60a) 

and  for  W2(r)  is 

\u(0"*2i(0  +  \>“$L(f)-  (60b) 

Following  the  procedure  used  in  the  last  section,  as  q'— * 
oo,  the  sum  over  k  in  (3),  call  it  becomes 

AiJ) -  Ej { a,^2[  V,(/), N2(  t);  0  <  /  <  T] } 

-exp(-|d2j,  (61) 

where 

<^“/r([8i>(,)~*iy(,)]2+[^(,)''^(f)]2)^-  (62) 
Jo 

The  steps  leading  to  (10)  remain  unchanged  with  (62) 
replacing  (8). 

We  now  assume  that  each  of  the  signals  in 
S,  -{£,,(/.  r),  £,2(/,  r),  £,,(<,  0}  and  S2  « 

<£21(r,r), £*(!,;),•••,  £^(r,  r)}  satisfy  the  average-energy 
and  peak-amplitude  constraints  in  the  section  about  mod¬ 
ulator  design  based  on  Ro  ^-  Then  we  have  the  following 
optimization  problem;  select  signals  in  =* 

{  *u(0.  fi2(0»  •  •  •  .$1,(0}  and  32-{g2l(r), 
822(0.-  ■  •  .92,(0}  to  maximize 

d2^[q(q~  l)]'1  2  2  /  ([  $,i(0 ~gv(0]2 

+  [$2,(')-$2,(0]V  (63) 

subject  to  the  following  constraints. 

i)  Equidistance  constraint:  The  quantities  in  (62) 
should  be  the  same  whenever  1 
li)  Average-energy  constraint:  (15)  should  be  satisfied 
for  both  signal  sets  and  32. 
iii)  Peak-amplitude  constraint:  (17)  should  be  satisfied 
for  both  signal  sets  (3,  and  9’2. 
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By  paralleling  the  development  leading  to  Lemma  1,  we 
have  the  following. 

Lemma  /'.  Let  rm„  be  the  maximum  average  signal 
counts  per  channel  use  in  each  polarization  component, 
and  suppose  (17)  applies  to  both  signal  sets  <3,  and  (?2- 

Then 

rfa<4[  (64) 

Furthermore,  equality  holds  if  and  only  if  both  a)  at  any 
time  ie[0,  7"],  all  signals  in  9,  and  §2  talce  on  t^le  value 
g_,_  except  at  most  one  in  i?,  and  one  in  that  takes  on 
the  value  and  b) 

7  2  fTgL(‘)dt-g1miaT~sm,  for  k  —  1,2. 

4  ,-i  Jo 

A  signal  set  t?  -  is,  u  (?2  11141  1S  equidistant  in  the  sense 
that  the  quantities  in  (62)  are  the  same  whenever  i+j, 
achieves  the  upper  bound  in  Lemma  1'  with  equality,  and 
which  therefore  maximizes  x  when  the  average-energy 
constraint  predominates  in  each  polarization  component, 
is  characterized  in  the  following  lemma  for  q  even. 

Lemma  2':  If  q  is  even  and  s _ satisfies  (19a),  equality 

is  achieved  in  (64)  by  the  following  signal  set.  For  I  <i  < 
(q/2)  andj-i  +  (<7/2), 

gm.  (i~  \)2T/q<t<(i-l+t)2T/q, 

Smrn-  otherwise  for  0  <  t  <  T, 

*r,(')  “£*(') -Smu,.  0  </<r, 

where  t  is  given  in  (26). 

The  verification  of  Lemma  2'  parallels  that  of  Lemma 
2.  It  is  interesting  to  note  that  for  q  —  4  this  signal  set,  then 
called  “quaternary  pulse  modulation,”  is  used  in  the  one 
gigabit  per  second  optical  communication  system  reported 
by  M.  Ross  et  al.  [15]. 
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Appendix 
Derivation  of  (27) 

Let  2  “(1,2,-  ■  •  ,q )  and  define  J *  by 

/•-  max  (T  2  [*i(')-?/(0]2<tf-  (Al) 

*,(•).*  ei  A,  ,J g2L  ‘ 


Then 

/•<T  max  max  2  [*.('>  “ftf')]2’  (A2) 

<e(o.r]  ,^£.1 L  J 

with  equality  if  and  only  if  the  integrand  in  (Al)  is  a  constant 
independent  of  i.  Thus  we  consider  the  problem  of  choosing  q 
real  numbers  gk,  k  €  2  to  maximize 

1(g)-  2  (g-g,)1  (A3) 

subject  to  <  &  <  g-..-  i  €2  .  A  necessary  condition  for  &*,< 
E  2  to  minimise  -  /  (and  so  maximize  /)  is  the  existence  of  2 q 
real  numbers  r,  >0,  n,  >0,  i—  1.2,-  •  •  ,q,  such  that  (16]*: 

L(g*./*,g)  <L(g,/LE),  for  all  f  in  R1,  (A4) 

g,”  <gmMX  implies  it,  -0,  (A5) 

&*  >gmUi  implies  e,  -  0,  (A6) 

where  the  Lagrangian  L  is  defined  by 

L(g,H,v)--  2  (ft~*y)2+  2 

tjea  iei 

+  2  y,(gaun-g,)-  (A7) 

Setting  the  derivative  of  L  with  respect  to  g,  equal  to  zero,  we 
obtain  for  i  E  2 

-2<7&*  +  2<tc*  + /i, (A8) 

where  c‘  —  q~  '2l6la  From  (A5)  and  (A6),  if  g _ <g*  <g _ 

then  ji,  ”  “0.  and 

(A9) 

Thus  each  g,*  takes  on  one  of  three  values:  gM,  X _ or  c*.  Let 

there  be  n _ and  Q-n _ -n_..  of  these,  respectively. 

From  the  definition  of  c*  we  have 

< F*  ■"«.!».  +  »«.»■«  +  (?*"«.  ~  Oc* 
or 

c*-{nma,gwum  +  nmMtgmMX)/(naat+  n^^).  (AI0) 

Furthermore, 

2  (a*-S/)2“2'W,^(Sm«-S:mJ2 

ijeH 

+  2 nM(q  -  -  /!,«,)(  -  c* )J 

+  2nmui(q  -  -  n„„)(c*  -  go**)2 

(All) 

Substituting  (A  10)  into  (All)  and  simplifying,  we  obtain 

/( g *)  ■ -  L(  f*.  it,  r)  -  2*(  u  -  gmm)2(  J-  +  -L  ) 

V  "an  nm»»  / 

(A12) 

and  this  is  to  be  minimized  subject  to  0  +  n _ 

and  n_„  being  nonnegative  integers,  which  is  the  same  as  the 
minimization  of  (l/n..,.)-Hl/n— .)  with  the  same  constraints. 
The  solution  to  this  is  for  q  even,  nmiB“,nM,“<f/2;  and  for  q 

'This  is  t  necessary  condition  for  a  regular  point  $' *  to  minimm  -  / 
subject  to  **»<*,<*«»„  «62.  Since  and  <0 

cannot  be  simultaneously  active  (that  is,  satisfied  with  equality),  it  is 
evident  that  the  set  of  gradient  vectors  of  the  active  conatrsunu  can 
include  a,  (the  ith  natural  basis  vector)  or  -  *,  but  not  both  (and  poaaibly 
neither).  Thus  the  set  of  gradient  vectors  of  the  active  constraints  is 
linearly  independent  for  any  g,  and  any  g  is  therefore  regular. 
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odd,  either  n_.  »(<;-»•  13/2  and  (fl- l)/2  or  - 

(q-  l)/2  and  ~(a  +  l)/2.  Thus2  for  q  even  we  have 

(AI3) 

and  for  4  odd, 

/(*•)- ^<9  +  I)(?-1K/«a-?«uJ2  (A14) 

The  corresponding  upper  bounds  on  d*1  —  q  ~  '(<?  —  iy  *  are, 
from  (AI), 


y(*+D«_,(f«,-f-Jar. 


4  even, 

(A15) 

9  odd. 
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ABSTRACT 

Several  decentralized  algorithms  for  determin¬ 
ing  the  shortest  paths  in  a  network  are  presented. 
Both  static  and  dynamic  networks  are  discussed. 
Each  algorithm  has  localized  information  and  commu¬ 
nication  requirements,  operates  asynchronously,  and 
converges  to  the  optimal  solution  in  finite  time. 

INTRODUCTION 

This  paper  concerns  shortest  path  algorithms 
that  enable  each  node  in  a  network  to  calculate  its 
shortest  distance  to  any  other  node  using  only  local 
knowledge  of  the  network  topology  and  only  local 
information  transfer  between  adjacent  nodes.  Such 
algorithms  are  of  obvious  importance  in  many  appli¬ 
cations  . 

Our  initial  algorithm  [1]  was  developed  for  a 
static  network  in  which  branch  lengths  and  topology 
remain  constant,  though  it  can  accommodate  limited 
changes.  We  discuss  here  a  number  of  modifications 
to  the  algorithm  that  enable  it  to  operate  in  a 
dynamic  network  in  which  branch  lengths  can  increase 
or  decrease,  and  nodes  or  branches  can  be  added  to 
or  removed  from  the  network.  The  ability  of  an 
algorithm  to  handle  such  topological  changes  is 
essential  in  most  practical  applications. 

THE  STATIC  ALGORITHM 

We  present  a  brief  outline  of  the  algorithm 
described  in  [1].  Consider  a  directed  graph  with 
branch  lengths  unrestricted  in  sign,  but  the  sum  of 
the  lengths  in  any  closed  loop  of  the  network  is  as¬ 
sumed  to  be  positive.  For  convenience,  we  assume 
that  the  existence  of  a  link  from  one  node  to  an¬ 
other  implies  the  existence  of  the  opposite  link. 
Very  little  topological  information  is  needed.  Each 
node  needs  to  know  only  which  nodes  are  its  neigh¬ 
bors,  the  length  of  the  branch  to  each  of  its  neigh¬ 
bors,  and  distance  information  received  from  these 
neighbors.  For  each  ultimate  destination,  a  node 
calculates  and  stores  an  assessment  of  the  shortest 
distance  via  each  of  its  neighbors,  the  smallest  of 
these  is  taken  to  be  its  assessment  of  the  distance 
to  that  destination.  Also  stored  is  the  identity  of 
the  neighbor  that  achieves  this  minimal  distance. 

Whenever  a  node ’ s  current  shortest  distance  to 
a  destination  changes,  either  through  reinitializa¬ 
tion  or  as  a  result  of  new  information  received  from 
a  neighbor,  this  new  distance  is  transmitted  to  all 
neighbors.  At  the  conclusion  of  the  algorithm,  each 


node  will  know  the  shortest  distance  to  each  other 
node  (or  that  no  path  exists),  the  next  node  in  the 
path  that  achieves  this  distance,  and  the  shortest 
distance  via  each  alternative  neighbor.  The  algor¬ 
ithm  is  guaranteed  to  converge,  even  if  it  is  imple¬ 
mented  in  an  asynchronous  manner. 

DYNAMIC  NETWORK  ALGORITHMS 

While  the  above  algorithm  handles  static  net¬ 
works,  many  problems  arise  for  a  dynamic  network 
model.  These  phenomena  include  branch  lengths 
decreasing  and  increasing,  branches  being  introduced 
into  the  network,  and  branches  failing  or  being 
removed  from  the  network.  The  static  algorithm  can 
guarantee  convergence  for  branch  length  decreases, 
but  not  increases  or  branch  failures. 

For  this  reason,  we  have  developed  several 
modified  versions  of  the  static  algorithm,  two  of 
which  are  described  in  this  paper.  Each  of  these 
modified  algorithms  is  based  primarily  on  some  form 
of  reinitialization  of  the  basic  static  algorithm; 
they  differ  mainly  in  the  mechanics  of  the  reini¬ 
tialization.  Both  share  the  convergence  properties 
of  the  static  algorithm,  provided  that  the  topology 
of  the  network  remains  constant  long  enough  for  the 
algorithms  to-  find  the  new  solution. 

Local  Reinitialization 

It  is  not  sufficient  to  merely  disseminate  a 
reinitialization  command  throughout  the  network  when 
a  branch  length  increase  or  branch  failure  occurs. 
Once  a  node  has  reinitialized,  it  needs  some  guaran¬ 
tee  that  subsequently-received  information  is  based 
on  all  affected  nodes  having  reinitialized  also. 
Thus,  a  mechanism  has  been  devised  for  each  node  to 
determine  which  distance  information  it  receives  is 
trustworthy  (in  that  all  nodes  further  down  the 
corresponding  path  are  guaranteed  to  have  reiniti¬ 
alized)  and  which  is  not.  In  simple  terms,  on 
hearing  that  a  branch  length  increase  or  failure  has 
taken  place,  a  node  ignores  distance  information 
sent  by  any  questionable  neighbor  until  that  neigh¬ 
bor  acknowledges  that  it,  too,  is  aware  of  the 
change.  As  each  neighbor  in  turn  so  acknowledges, 
the  embargo  on  its  information  is  removed.  In  this 
way,  some  convergence  toward  the  new  solution  can  be 
taking  place  while  news  of  the  change  is  still 
propagating  through  the  network. 

More  precisely,  this  algorithm  involves  the  use 
of  a  "special  action",  which  is  initiated  by  a  node 
detecting  a  branch  increase  or  failure.  When  this 


occurs,  Che  initiator  assigns  a  unique  index  to  the 
special  action  (consisting  of  his  node  cumber  and  a 
counter),  and  does  the  following: 

1.  Reinitializes 

2.  Places  an  embargo ,  indexed  by  the  special 

action,  on  distances  received  from  every 

neighbor. 

3.  Transmits  all  shortest  distances  to  each  neigh¬ 
bor,  along  with  the  special  action  index. 

Each  neighbor  receiving  this  information  Cakes 
analogous  action,  in  Step  2  placing  an  embargo  on 
all  his  neighbors  except  for  the  one  he  has  just 
heard  from,  and  in  Step  3  using  the  same  special 
action  index.  He  then  waits  until  a  message  is 
received  from  a  neighbor,  at  which  point  his  action 
is  governed  by  the  following: 

Case  1.  Message  contains  no  special  action  index. 

A.  If  there  is  no  embargo  on  this  node, 

calculate  the  new  distances  via  this 
neighbor  and  proceed  normally. 

B.  Else,  ignore  the  message. 

Case  2.  Message  contains  a  special  action  index. 

A.  If  there  is  no  embargo  on  this  node  with 
the  same  index  as  in  the  message,  perform 
steps  1-3  above. 

B.  Else,  remove  the  matching  ban  from  this 

particular  neighbor.  If  no  other  embargo 
exists,  calculate  new  distance  via  this 

node.  Otherwise,  ignore  distance  compo¬ 

nent  of  message. 

This  algorithm  can  effectively  handle  increas¬ 
ing  and  decreasing  branch  lengths  and  failures,  but 
some  other  technique  must  be  introduced  if  one 
wishes  to  add  nodes  or  links  to  the  network.  One 
such  technique  is  described  here. 

Global  Reinitialization  Procedure.  This  pro¬ 
cedure  brings  about  global  reinitialization  by 
effectively  suspending  all  communication  of  distance 
information  for  a  sufficiently  long  period  of  time 
to  insure  that  all  nodes  have  reinitialized.  Sever¬ 
al  possible  mechanisms  for  achieving  this  present 
themselves:  one  is  for  one  of  the  nodes  incident  to 
the  new  link  to  decide  upon  a  future  time  at  which 
cossminication  of  distance  information  based  on 
reinitialization  will  resume,  and  to  send  this  to 
his  neighbors  who  continue  to  propagate  it  through¬ 
out  the  network.  Implicit  here  is  the  existence  of 
a  time  base  common  to  all  nodes,  and  the  availabili¬ 
ty  to  each  node  of  (at  least  an  upper  bound  on)  the 
time  it  takes  for  the  "reinitialization  message”  he 
initiates  to  propagate  throughout  the  network,  which 
implies  a  more  global  knowledge  of  the  network. 

The  resulting  algorithm  uses  primarily  local¬ 
ized  reinitialization,  with  global  reinitialization 
being  used,  hopefully  infrequently,  only  to  allow 
new  links  (nodes)  to  be  added  to  the  network.  The 
algorithm  can  accommodate  an  arbitrary  number  of 
topological  changes,  requiring  only  an  eventual 
cessation  of  topological  changes  in  order  to  con¬ 
verge  to  the  optimal  solution. 


Local  Reinitialization  with  Acknowledi 


This  version  of  the  algorithm  is  a  modification 
of  the  previous  one,  developed  to  solve  the  problem 


of  adding  ’.inks.  If  a  node  remembers  the  indices  of 
special  actions  after  it  processes  them,  it  can 
insure  that  any  new  neighbors  that  it  acquires  are 
informed  of  the  most  recent  special  actions.  But 
how  long  nust  a  node  remember  which  actions  have 
occurred?  In  a  network  in  which  branch  increases 
are  a  common  occurrence,  it  would  be  infeasible  for 
a  node  to  remember  every  special  action  that  it  had 
received.  Actually,  a  node  need  remember  a  particu¬ 
lar  special  action  only  until  it  can  be  certain  that 
every  node  in  the  network  has  knowledge  of  it. 

We  therefore  introduce  an  acknowledgment  sys¬ 
tem.  For  each  special  action  generated  in  the 
network,  we  create  a  tree,  rooted  at  the  initiator 
node,  to  which  nodes  are  added  as  they  are  first 
informed  of  the  special  action.  When  the  tree  is 
complete,  acknowledgments  are  sent,  via  the  tree,  to 
the  initiator  node,  who  then  sends  a  message  to 
erase  all  memory  of  this  special  action. 

Unfortunately,  this  modification  introduces  a 
new  difficulty;  failure  of  a  node  or  link  of  a  tree 
can  disrupt  the  flow  of  acknowledgments,  necessitat¬ 
ing  some  type  of  emergency  action.  As  in  the  previ¬ 
ous  algorithm,  the  Global  Reinitialization  Procedure 
is  one  possible  solution  to  this  problem,  the  advan¬ 
tage  here  being  that  only  certain  failures  necessi¬ 
tate  this  emergency  action,  whereas  all  new  links 
required  this  action  previously. 

CONCLUSION 

We  have  developed  a  static  algorithm  and  two 
dynamic  algorithms  that  require  only  local  informa¬ 
tion  transfer  and  local  topological  knowledge. 
These  algorithms  have  finite-time  asynchronous 
convergence  properties.  Each  has  certain  advantages 
and  disadvantages,  and  the  applicability  and  suita¬ 
bility  of  each  is  a  function  of  the  characteristics 
of  the  particular  network  under  consideration. 
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Abstract .  Several  inter- related  quantitative  measures  of  controllability, 
observability,  reachability,  and  reconstructibility  are  introduced,  their 
properties  discussed,  and  some  of  the  implications  indicated.  The  measures 
vary  continuously  with  the  system  parameters  and  the  system  state,  and  they 
reflect  appropriate  duality  relationships.  Two  areas  of  application  are 
considered:  one  is  in  quantifying  the  degree  of  interaction  between  system 
input  and  output,  and  the  other  is  in  providing  simple,  design-oriented 
upper  and  lower  bounds  on  estimation  performance. 

Keywords .  Controllability;  observability;  linear  systems;  control  theory; 
system  theory;  filtering;  Kalman  filter;  interaction;  performance  bounds. 


INTRODUCTION 

The  large  and  highly-developed  body  of 
Knowledge  concerning  the  structural  proper¬ 
ties  of  linear  systems  is  framed  almost 
entirely  in  terms  of  "yes"  and  "no"  ques¬ 
tions  and  answers;  a  state  is  either  reach¬ 
able  or  it  is  not,  an  input  disturbance  is 
either  localized  away  from  an  output  or  it 
is  not,  a  system  is  decoupled  or  it  is  not; 
in  each  case,  available  characterizations 
afford  conditions  that  can  be  checked  to 
determine  which  of  the  two  holds  true  or 
can  be  made  to  hold  true  with,  for  example, 
suitable  state  feedback.  Almost  all  of 
these  involve,  directly  or  indirectly,  the 
controllability  and  observability  proper¬ 
ties  of  the  system.  There  is,  however,  no 
body  of  knowledge  relating  to  the  approxi¬ 
mate  achievement  of  these  goals,  or,  more 
generally,  of  the  degree  to  which  they  are 
achieved.  For  many  practical  purposes  it 
is  sufficient  if,  for  example,  an  input  has 
an  acceptably  small  influence  on  an  output, 
and  it  is  not  necessary  for  this  influence 
to  be  zero.  Especially  in  large  systems, 
some  measure  of  the  degree  of  interaction 
or  noninteraction  between  subsystems  seems 
essential  for  analyzing  the  system,  for 
designing  estimation  or  compensation 
schemes,  and  for  assessing  the  performance 
of  these  estimators  and  controllers  (in 
terms,  say,  of  performance  bounds). 
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We  introduce  here  several  inter-related 
measures  of  reachability  and  observability 
(also  controllability  and  reconstructi¬ 
bility)  that  are  quantitative  in  nature  and 
depend  continuously  on  the  system  parame¬ 
ters  and  the  state  concerned.  These  mea¬ 
sures  exhibit  appropriate  duality  relation¬ 
ships.  We  also  illustrate  the  applications 
of  these  in  two  areas:  The  first  is  in 
quantifying  the  degree  of  interaction 
between  system  input  and  output,  and  the 
second  is  in  providing  upper  and  lower 
bounds  on  estimation  performance. 

For  simplicity  of  presentation  we  concen¬ 
trate  here  on  constant,  continuous-time 
systems.  Entirely  analogous  results  apply 
for  discrete-time  systems,  and  the  exten¬ 
sion  to  time-varying  systems  is  straight¬ 
forward.  Extension  can  also  be  made  to 
infinite-dimensional  systems  such  as  those 
satisfying  partial  differential  equations 
or  delay-differential  equations. 


■MEASURES  OF  REACHABILITY, 
CONTROLLABILITY,  OBSERVABILITY, 
AND  RECONSTRUCTIBILITY 

Consider  the  continuous-time  linear  system 


x(t) 

=  Ax(t)  +  Bu(t) 

(la) 

y(t) 

=  Cx(t ) 

(lb) 

with  x(t)eRD,  u(t)£R^,  and  y(t)eRl”. 

One  natural  way  to  measure  the  reachability 
of  a  given  state  x  at  time  T  is  as  the 
maximum  value  of  the  inner  product  z'x  over 
all  states  z  to  which  the  system  ila)  can 
be  driven  at  time  T  (from  the  'rigin  at 


Ian  B.  Rhodes 


time  zero)  with  at  most  1  'imt  of  input 
energy ,  1 . e . , 

r(xj  =  max  [x'z:  z  -  j"  "'Buitjdt, 

I  0 

fX.  1  | 

j  u* (t)u(t)dt  <  I 

_  0  J  \ 

Performing  this  straightforward  maximiza¬ 
tion  we  find 

r(x)  =  (x'V^O.Tjx)*1  (2) 

where  W_(0,T)  is  the  reachability  Gramian 
(Brocket,  1970) 

W  (0,T)  =  / eA(T*t)BB,eA' (T'C)dt  (3) 
R  0 

If  x  has  norm  1,  r(x)  reduces  to  the  pro¬ 
jection  along  x  of  all  states  of  (la)  that 
are  reachable  with  at  most  one  unit  of  in¬ 
put  energy. 

Iff  r(x)  =  0  then  x  is  unreachable  in  the 
standard  sense  of  the  term,  i.e.,  orthogon¬ 
al  to  the  set  of  reachable  states,  so  the 
inner  product  x'z  is  identically  zero; 
equivalently,  x  is  in  the  nullspace  of 
WR(0,T).  Large  values  of  r(x)  correspond 
in  some  sense  to  states  that  are  easily 
reached,  though  not  in  the  classic  sense  of 
the  term,  since  such  states  may  have  com¬ 
ponents  that  lie  in  the  nullspace  of 
WR(0,T)  and  so  cannot  be  reached. 

It  can  be  easily  shown  that  r(-)  varies 
continuously  with  both  the  system  parame¬ 
ters  A  and  B,  the  final  time  T,  and  the 
state  x. 

It  is  also  easily  seen  that  r(x)  is  a 
sublinear  function  of  x.  It  is  also  con¬ 
vex,  so  that,  in  particular,  the  set  of 
states  whose  reachability  measure  is  no 
greater  than  a,  i.e.,  {x:  r(x)  <  a},  is  a 
convex  set  for  all  a  >  0. 

The  corresponding  dual  observability  mea¬ 
sure  of  a  state  x  at  time  0  is  simply  the 
L.-norm  of  the  output  function  y  over  [ 0 , T J 
that  is  produced  by  x,  i.e., 

o(x)  =  |yi|  =  (x'MjjCO.TJx)**  (4) 

where  Mq(0,T)  is  the  observability  Gramian 

T  , 

Mq(0,T)  =  /  eA  tC,CeAtdt  (5) 


large  norm,  and  in  this  sense  x  is  highly 
observable . 

The  observability  measure  oi*j  varies 
continuously  with  the  system  parameters  A 
and  C,  the  final  time  T,  and  the  state  x. 
It  is  sublinear  and  convex;  the  set  of  all 
states  whose  observability  measure  is  less 
than  a  given  0  is  convex;  i.e.,  {x:  o(x)  < 
0}  is  convex. 

These  measures,  r(x)  and  o(x),  preserve 
suitable  duality  conditions.  It  is  easily 
checked  that  the  reachability  measure, 
r  (x),  of  the  system  dual  to  (1)  is  simply 
the  observability  measure  o(x)  of  (1). 
Conversely,  the  observability  measure, 
ou(x),  of  the  system  dual  to  (1)  is  the 
reachability  measure  of  (1). 

It  is  convenient,  in  order  to  avoid  the 
square  roots  in  (2)  and  (4),  to  work  with 
the  measures 

R(x)  =  |  r2(x)  =  i  x'Wr(0,T)x  (6) 

0(x)  =  |  o2(x)  =  |  x 1 Mq(0 ,T)x  (7) 

The  physical  interpretations  of  these  mea¬ 
sures  are  just  as  one  half  of  the  squares 
of  the  quantities  involved  in  defining  r(x) 
and  0(x) :  thus,  RCxj^is  one-half  of  the 
maximum  value  of  (z'x)"1  over  all  appropri¬ 
ate  z,  while  0(x)  is  one-half  of  the  square 
of  the  L~-norm  of  the  output  function  y 
over  [0,T f  that  is  produced  by  x. 

Another  reachability  measure  is  the  conju¬ 
gate  functional  of  R.  The  conjugate  func¬ 
tional  of  a  convex  functional  R  is  defined 
by  (see,  e.g.,  Luenberger,  1969), 

R*(z)  =  sup  [z'x  -  R(x) ]  (8) 
x 

Performing  the  indicated  maximization  with 
R(x)  given  by  (6)  we  find 

R*(z)  =  y  z’WR'l(0,T)z  (9) 

assuming  that  W  (0,T)  is  invertible;  if  it 
is  not,  R*(z)  is  still  defined  by  (8)  and 
takes  the  value  •  if  !  lies  in  tne  null- 
space  of  Wr(0,T).  For  simplicity  of  nota¬ 
tion,  we  assume  here  that  W  (0,T)  is  in¬ 
vertible. 

This  reachability  measure  (9)  has  a  simple 
interpretation  in  terms  of  the  system  (1): 
it  is  one-half  of  the  minimum  amount  of 
control  energy  needed  to  reach  state  z  at 
time  T  from  state  0  at  time  0,  i.e., 


We  note  that  o(x)  is  zero  iff  the  state  x 
is  unobservable  in  the  standard  sense  of 
the  term,  i.e.,  x  gives  rise  to  an  output 
that  is  identically  zero  over  (0,T|; 
equivalently,  x  lies  in  the  nullspace  of 
Mg(0,T).  Large  values  of  o(x)  mean  that  x 
gives  rise  to  an  output  function  y  with 


R*(z) 


min 


l  1 


J  u  ‘  ( t )  u  ( t )  d  t : 
0 


z 


0 


A(T-t) 

e 


But t )dt 


I 

I 


(  10) 
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This  minimum  energy  is  known  from  standard 
least-squares  linear  system  theory 

(Brockett,  L9  70)  to  be  just  z'W  iO,Dz, 
so  that  R*(2)  of  (9)  is  one-naif  this 
minimum  energy. 

Thus  the  measure  R*  given  by  i9)  is  in  fact 
a  measure  of  unreachability :  small  values 
of  R*  correspond  to  small  values  of  re¬ 
quired  input  energy,  and  thus  to  relatively 
easily  reached  states;  large  values  of  R* 
correspond  to  large  required  input  energies 
and  thus  to  states  that  are  more  difficult 
to  reach;  in  particular,  states  in  the 
nullspace  of  W_(0,T)  (and  thus  unreachable 
according  to  the  "classical"  definition) 
have  R*(z)  =  ».  We  observe  that  R*  is 
convex,  as  are  all  conjugate  functionals  of 
convex  functioaals. 

The  corresponding  conjugate  functional  of  0 
is 

0*(z)  =  i  z ' Mq~ 1 (0 ,T)z  (11) 

An  interpretation  of  this  in  terms  of  the 
system  (1)  follows  by  considering  the  prob¬ 
lem  of  minimizing  the  norm  of  the  linear 
functional  on  the  output  function  y  that 
produces  the  projection  (or,  more  general¬ 
ly,  the  inner  product)  of  the  initial  state 
Xg  on  the  vector  z.  Suppose  x^z  is  found 
as  the  linear  functional 

XqZ  -  J  w' (t)y( t)dt  ;  (12) 

the  function  w  of  minimum  L^-norm  that  ac¬ 
complishes  this  is 

w°(t)  *  CeAtM0‘l(O,T)z  (13) 

so  that 

\f  w0,  (t)w°(t)dt 
0 

=  |  z’Mq'^O.TJz  =  0*(z )  (14) 

Thus  0*  gives  a  measure  of  ^observability : 
Small  values  of  0* ( z )  correspond  to  a  small 
effort  to  determine  x^z  and  thus  to  a  "more 
observable”  z  than  do  large  values  of 
0*(z).  .Vote  that  0*(z)  =  »  if  i ;  is  in  the 
nullspace  of  Mq(0,T),  corresponding  to  a 
state  that  is  unobservable  in  the  standard 
sense  of  the  term.  As  with  all  our  mea¬ 
sures,  0*  is  convex. 

The  measures  0*  and  R*  also  preserve  appro¬ 
priate  duality  relations:  the  observabil¬ 
ity  measure  0  ^  of  the  dual  to  system  (1) 
is  the  reachability  measure  R*  of  system 
( 1 ) ,  and  vice  versa . 

As  a  means  of  making  more  concrete  the  re¬ 
lationship  between  R  and  R*,  observe  that 
Lf  W^(0,T)  is  diagonalized  by  the  similar¬ 


ity  transformation  T,  so  that  T'V  . 0,T)T  = 
\  =  diag  i A  ! ,  then 

R I  x )  =  4  x  '  W  i  0 , 1  j  x  =  y  iT'xi’  MT'xl 


whereas 

R*(x)  =  \  x'Wr'1(0,T)x 

=  1  (T‘x)1a"1(T'x)  =  i  ^  A^fT'x)2 

2  2  l  i 

i  =  l 

This  makes  clearer  the  earlier  observation 
that  R  is  a  measure  of  reachability,  where¬ 
as  R*  is  a  measure  of  unreachability. 

Entirely  analogous  measures  of  controlla¬ 
bility  and  recons  tructibi  1  ity  can  also  be 
constructed,  with  analogous  interpretations 
to  those  above. 

Controllability: 

C(x)  =  |  x'Vct0,T)x  (15) 

Uncontrol lability: 

C*(z)  =  j  z’Wg'^O.Dz  (16) 

where  W  (0,T)  is  the  controllability 
Gramian  (Brockett,  1970) 

W  (0,T)  =  hf e‘AtBB,e'A'£dt  (17) 

L  ^  0 

Large  values  of  C  correspond  to  easily- 
controlled  states,  as  do  small  values  of 
C*.  For  states  that  are  uncontrollable  in 
the  classical  sense,  i.e.,  in  the  nullspace 
of  Wc(0,T),  C(x)  =  0,  and  C*(x)  =  *. 

Reconst rue tibility : 

p(x)  =  J  x'MR(0,T)x  ( 18) 

Un reconstruct lbility : 

P*(z)  =  |  z '  Mj^"1  (0  ,T)z  (19) 

where  M^(0,T)  is  the  recons tructibility 
Gramian 

h  (0,T)  =  /e'A'  (T't)C'Ce‘A,T"t)dt  (20) 

R  0 

Easily-reconstructed  states  have  large 
values  of  p  and  small  values  of  p*.  States 
that  are  unreconstructible  in  the  classical 
sense,  i.e.,  in  the  nullspace  of  1R(0,T), 
have  p(x)  =  0  and  p*(x)  =  x 
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INTERACTION  AND  NON-INTERACTION 

We  now  turn  to  the  application  of  the 
measures  introduced  above  to  quantifying 
tne  degree  of  interaction  between  an  input 
and  an  output  of  the  system  (1).  We  empha¬ 
size  that,  in  this  context,  u  in  (1)  may  be 
simply  a  part  of  the  total  input  and  y  in 
(1)  may  be  simply  part  of  the  total  system 
output,  so  that  we  are  thinking  in  terms  of 
possible  applications  to  disturbance  rejec¬ 
tion  or  decoupling.  We  adopt  as  a  measure 
of  interaction  between  input  and  output  the 
number  I  defined  by 

I  =  sup  |  Ajf  y ' (t)y(t)dt:  y(t)  =  CeAtxQ, 

I  *  0 

°r  -At 

xQ  =  J  e  Bu(t )dt , 


\  J  u’ (t)u(t)dt  <  1  (21) 

i  -T  ' 

i.e.,  I  is  one  half  of  the  maximum  L_-norm 
of  the  output  function  y  over  [ 0 , T r  that 
can  be  produced  by  an  input  function  u  over 
[ -T , 0 )  with  one-half  of  the  L,-norm  of  u 
constrained  to  be  no  greater  ^.han  unity. 
Using  the  definition  of  0  in  (7)  via  (A), 
the  interpretation  of  R*  given  by  (10),  and 
the  observation  that  WR(-T,0)  =  WR(0,T)  for 

a  constant  system,  it  is  straightforward  to 
see  that 

I  =  sup{0(x) :  R*(x)  <  11  (22) 

One  intuitive  interpretatio»__of  this  ex¬ 
pression  is  that  large  interaction  between 
input  and  output  is  a  consequence  of  at 
least  some  states  whose  observability 
measure  0(x)  is  large  also  being  reasonably 
reachable  (in  the  sense  that  the  unreach¬ 
ability  measure  R*(x)  is  no  greater  than 
1).  If  all  states  having  high  observabil¬ 
ity,  as  measured  by  0(x),  also  have  low 
reachability  (as  measured  by  a  large  un¬ 
reachability  R*(x)),  then  input-output 
interaction  will  be  small.  That  interac¬ 
tion  between  input  and  output  should 
epend  on  the  reachability  and  observabil¬ 
ity  of  the  states  is  to  be  expected:  the 
above  expression  for  I  quantifies  this 
dependence  in  terms  of  specified  measures 
of  reachability  and  observability.  We  re¬ 
mark  that  states  x  for  which  the  supremum 
on  the  right  side  of  (22)  is  achieved  can 
be  interpreted  as  the  states  through  which 
maximum  input-output  interaction  takes 
place . 

The  corresponding  dual  expression  is 

I  =  s up { R ( x ) :  0*(x)  <  1}  (23) 

and  this  has  a  dual  interpretation  to  that 
above.  This  expression  follows  by  noting 
that  I  is  simply  one-half  of  the  square  of 


the  induced  norm  oi  the  linear  operator  L 
mapping  inputs  in  L,|-T,0|  to  outputs  in 
L  ( 0 , T  |  ;  L  can  be  detomposed  as  L  =  ^2^1' 
where  L,  maps  inputs  over  [ -T , 0 )  into 
states  at  time  0,  and  maps  states  at 
time  0  to  outputs  over  [0,T].  Because 

(Luenberger,  1969  ),  !|L'-ViI  =  IL^L^II  =  'lLi|  = 

llL_L.ll,  where  L*  is  the  adjoint  of  L  and 
corresponds  to  the  system  dual  to  (1),  we 
have  using  (22)  and  the  previously-noted 

dualities  0d(x)  =  R(x)  and  R  d(x)  =  0*(x), 

I  =  ||L*||2  =  sup{Od(x):  R*d(x)  <  1} 

=  sup{R(x) :  0*(x)  <  1 ! , 

which  is  (23) . 


BOUNDS  ON  ESTIMATION 
PERFORMANCE 

Consider  now  the  stochastic  system 

dxt  =  Axtdt  +  Bdv^  (24a) 

dz^  =  Cx^dt  +  dw^  (24b) 

where  v  and  w  are  independent  normalized 
Wiener  processes  that  are  independent  of 
the  initia_l  state  x^,  which  is  Gaussian 
with  mean  xQ  and  covariance  2 g.  The  well- 
known  Kalman  filtering  equations  for  x  - 
E[x  |2  ),2t  being  the  a-algebra  generated 
by  the  output  process  z  over  [0,t),  are 

dxt  =  Axtdt  +  2(t)C'[dzt  -  Cx^dt 1  ; 


where 

1  =  A2  -  IA'  -  1CCZ  +  BB'  ; 

HO)  =  20  (26) 


or,  alternatively, 

Z'1  =  ( — A ' )2‘l  +  2  1 

(- A)  -  Z^BB'l' 

■1 

+  C'C;  2_1(0) 

:27) 

A  number  of  upper  and 

lower  bounds 

on 

£(T)  in  terms  of  the 

reachability 

and 

reconstructibility  Gramians  can  be  derived 
using  arguments  similar  to  those  in  (Soren¬ 
son,  1968).  These  are  matrix-ordering 
bounds  of  the  form  P  >  Q,  meaning  that 
P  -  Q  is  non-negative  definite.  Included 
here,  for  example,  is  the  upper  bound 

Z(T)  <  MR'l(0,T)  +  Wr(0.T)  (28) 

from  which  we  have  immediately  that  the 
variance  a’I(T)a  of  a’x^,  x^  =  -  Xy,  is 
bounded  above  by 
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Varla’xT)  <  a'MR'l(0,T)a  ♦  a'WR(0,T)a 

=  2p* ( a )  ♦  2R(a )  (29) 

Thus  Var(a'x_)  is  guaranteed  to  be  small  if 
p*la)  is  small  (i.e.,  a  is  easily  recon¬ 
structed)  and  R(a)  is  small  (i.e.,  a  is 
relatively  unreachable  from  the  process 
noise  v),  an  intuitively  reasonable  result. 

A  lower  bound  on  2(T)  can  be  obtained  by 
applying  the  ab^ve  upper  bound  on  Z  to  the 
equation  for  I  In  this  case  we  obtain 

l'l(T)  <  WR 1 ( 0 , T )  ♦  MjCO.T)  (30) 

so  that. 

a ' Z" 1 (T)a  <  2R*( a )  ♦  2p(a)  (31) 

Thus  a'Z  ^ CT) a  will  be  large  if  R*(a)  is 
large  (i.e.,  a  is  difficult  to  reach  from 
v)  and  if  p(a)  is  large  (i.e.,  a  is  highly 
reconstructible ) ,  which  is  again  intuitive¬ 
ly  expected.  It  is  also  intuitively  rea¬ 
sonable  that  in  bounds  for  this,  a  filter¬ 
ing  problem,  reachability  and  reconstructi- 
bility  (rather  than  controllability  and 
observability)  are  involved.  For  example, 
one  expects  (and,  indeed,  can  show)  that 
observability  is  involved  in  smoothing 
problems . 

These  upper  and  lower  bounds  have  a  number 
of  simple  consequences,  some  of  which  we 
now  outline: 

Nonlinear  Observations 

It  is  shown  in  (Snyder  and  Rhodes,  1972) 
that  if  the  linear  observations  (24b)  are 
replaced  by  the  nonlinear  ones 

dzt  =  h(xt)dt  +  dwt  (32) 

where  h  is  continuously  differentiable, 
then  the  conditional  covariance  of  the 
state  given  the  observations  is  bounded 
below  according  to 

Cov(xT[2TJ  >  r (T) 

where 

f  =  at  ♦  ta’  -  rsr  +  bb'  ; 

and 

s  ■  #)'©! 

If  S  is  factored  as  C’C, 
is  seen  to  satisfy  the  Riccati  equation 
(26).  To  this,  the  lower  bound  (30)  can  be 
applied  in  turn  to  yield 

a'  (Cov|xt]2t))  *a  <  aT"1(T)a 

<  2R*(a)  +  2p(a )  (35) 

where  p(a)  might  be  thought  of  as  an  aver¬ 
age  reconstructibility  of  the  system  with 


(33) 

r(0)  =  ZQ  (34a) 

(34b) 

the  lower  bound  f 


nonlinear  observations  132),  since  it 
i nvo  Ives 


Mr(0,T)  =  f  e'A  (T't JSe 


-A’ ( T-t  J _  -A(T-t) 


dt  (36) 


where  S  is  given  by  (34b). 

Noise  Scaling 

If  h  in  (32)  is  replaced  by  ah  (or  C  in 
(24b)  is  replaced  by  aC),  corresponding  to 
decreasing  the  observation  noise  covariance 
by  a  ,  then  ttjj  is  replaced  by  ®  71^,  P  by 

a^p,  and  p*  by  a  ^p*.  Similarly,  if  B  is 
replaced  by^  38  to  reflect  an  increase  by  a 
factor  of  8  of  the  input -noise  covariance, 
then  WR  is  replaced  by  8  WR,  R  by  Bn  and 

R*  by  8  ^8*.  The  bounds  given  above  are 
affected  simply  by  these  noise  scalings, 
whereas  the  conditional  error  covariance 
(Z)  is  not. 


Additional  Observations 


Suppose,  in  addition  to  the  observations 
(32)  or  (24b),  we  also  have 

dzt  =  ii(xt)dt  +  dw^  (37) 

or 

dzt  =  Cx^dt  +  dwt  (38) 

where  w  is  a  normalized  Wiener  process 
independent  of  the  initial  state  and  all 
other  Wiener  processes  introduced  earlier. 
The  right  side  of  the  Riccati  equation  (26) 
then  has  an  additional  term  -ZC'CZ,  and  the 
lower  bound  T  given  by  (34)  an  additional 
term  -rST.  In  either  case,  the  solution  Z 
or  T  is  not  affected  simply  by  the  addi¬ 
tional  term.  On  the  other  hand,  it  is 
easily  seen  that  the  reconstructibility 
Gramian  for  the  combined  measurements  z  and 
z  is  simply  the  sum  of  the  reconstructibil¬ 
ity  Gramians  MR  and  MR  corresponding  to  the 
separate  observations.  Thus,  Z(T)  (or 
r ( T ) )  is  bounded  below  according  to 

Z’(T)  <  WR*^ (0 ,T)  ♦  Mr(o,T)  +  MR(0,T)  (39) 

and 

a ' Z*  (T)a  <  2R*(a)  +  2p(a)  ♦  2p(a)  (40) 

The  effect  of  the  additional  observation  on 
(here)  the  lower  bound  is  extremely  simple, 
permitting  immediate  assessment  of  the 
utility  of  additional  observations  or  com¬ 
parison  of  alternative  observations. 

Measurement  Design  Problems 

The  bounds  also  provide  a  simple  framework 
for  measurement  design.  For  example,  for  a 
fixed  vector  a  of  interest,  we  can  consider 
the  maximization  of  the  right  side  of  (31) 
with  respect  to  C.  This  is  equivalent  to 
maximizing 
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2p(a)  =  a'MR(0,T)a 


=  a'-fe 


-A'(T-t)c.Ce-A(T-t)dt  a 


=  rrace  c 


'/ 


-A(T-t)  .  -A(T-t)dtC 


trace 


|c'Cc(0,T)c| 


where,  (making  change  of  variables  s 
T  -  t), 


8(0, T)  =  / e‘Asaa’e'A’Sds 


is  the  controllability  Gramian  of  the 
system  (A,a,C).  If,  for  example,  C  is  a 
row  vector,  the  solution  to  this  optimiza¬ 
tion  problem  is  simply  to  take  the  vector 
C'  along  the  eigenvector  corresponding  to 
the  maximum  eigenvalue  of  0^,(0, T). 


CONCLUSIONS 

The  quantitative  measures  of  controllabil¬ 
ity,  observability,  reachability,  and 
reconstructibility  introduced  here  have 
desirable  continuity  and  duality  proper¬ 
ties.  Their  applications  include  measuring 
the  degree  of  interaction  between  system 
input  and  output,  and  providing  a  design- 
oriented  framework  for  evaluating  estima¬ 
tion  performance  in  terms  of  upper  and 
lower  bounds. 
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Abstract  —This  paper  presents  several  decentralized  algorithms  for  find¬ 
ing  all  shortest  paths  in  a  network.  Both  static  and  dynamic  networks  are 
discussed.  Each  algorithm  has  localized  information  and  communication 
requirements,  operates  asyndsonouslv,  and  converges  to  the  optimum  in 
finite  time. 

I.  Introduction 

IN  RECENT  years  a  major  area  of  research  has  been 
the  development  of  decentralized  decision,  estimation, 
and  control  schemes  for  large-scale  systems.  The  concept 
of  decentralization  has  applications  to  many  different  areas 
of  systems  theory.  For  example,  Lau  et  al.  [1]  developed  a 
decentralized  algorithm  for  solving  the  problem  of  maxi¬ 
mum  flow  through  a  network;  minimum-delay  routing  has 
been  considered  by  Gallager  [2];  Sandberg  [3]  developed  a 
decentralized  scheme  for  synchronizing  digital  transmission 
systems;  and  the  question  of  stability  for  large-scale  sys¬ 
tems  has  been  addressed  by  Siljak  [4],  among  others.  The 
reader  is  also  referred  to  the  April  1978  issue  of  the  IEEE 
Transactions  on  Automatic  Control,  which  was  de¬ 
voted  to  large-scale  systems,  and  in  particular  the  survey 
paper  by  Sandell  et  al.  [5]  and  the  references  therein. 

In  this  paper  we  are  concerned  with  the  decentralized 
shortest  path  problem  and  the  development  of  algorithms 
that  enable  each  node  in  a  network  to  calculate  its  shortest 
distance  to  any  other  node  using  only  local  knowledge  of 
the  network  topology  and  only  local  information  transfer 
between  adjacent  nodes.  Shortest  path  problems  arise  in 
many  contexts,  and  algorithms  with  decentralized  require¬ 
ments  are  of  obvious  importance  in  many  applications. 

Our  intention  has  been  to  develop  algorithms  general 
enough  to  be  applicable  in  a  variety  of  situations.  We  have 
had  the  same  goal  of  generality  for  the  network  models 
used  and  the  assumptions  made.  While  our  algorithms  have 
potential  application  to  a  wide  variety  of  specific  situa¬ 
tions,  including  computer-communication  networks,  we  do 
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not  consider  the  specific  characteristics  of  such  networks. 
We  also  point  out  that  in  some  situations  the  shortest  path 
problem,  which  we  discuss,  is  different  from  the  optimal- 
routing  problem. 

The  main  result  presented  in  this  paper  is  a  decentralized 
shortest  path  algorithm  for  dynamic  networks  in  which 
branch  lengths  can  increase  or  decrease,  and  branches  or 
nodes  can  be  added  to  or  removed  from  the  network.  This 
algorithm  operates  asynchronously,  has  local  information 
and  communications  requirements,  and  is  guaranteed  to 
converge  to  the  optimum  in  finite  time.  The  foundation  of 
this  algorithm  is  our  static  algorithm  [6],  which  is  based 
primarily  on  an  algorithm  of  Ford  and  Fulkerson  [7],  and 
which  is  similar  in  spirit  to  those  found  in  [8]-{10],  and 
elsewhere. 

The  static  algorithm  is  summarized  in  Section  II.  Section 
III  contains  the  main  version  of  the  dynamic  algorithm. 
Some  alternatives  to  and  modifications  of  the  dynamic 
algorithm  are  given  in  Section  IV;  for  some  of  these 
convergence  proofs  exist,  while  for  others,  which  appear 
intuitively  to  lead  to  more  rapid  convergence  or  greater 
robustness,  proofs  have  yet  to  be  found.  Finally,  conver¬ 
gence  proofs  and  precise  descriptions  of  the  static  algo¬ 
rithm  and  the  main  dynamic  algorithm  can  be  found  in 
Appendix  I. 

II.  The  Static  Algorithm 

Consider  a  directed  graph  consisting  of  N  nodes,  de¬ 
noted  A  =  { l,2,---,iV},  and  a  collection  of  links  (branches) 
L  —  {( i,  j):  i.  jE  A  and  there  exists  a  link  from  /  to  j). 
There  is  at  most  one  link  from  any  node  i  to  any  node  j. 
and  no  link  from  any  node  to  itself.  It  is  convenient, 
though  not  necessary,  to  assume  that  the  existence  of  a  link 
from  i  to  j  implies  the  existence  of  one  from  j  to  i.  In  some 
situations,  network  links  are  used  also  for  communication, 
but  it  may  be  that  network  links  carry  certain  commodities 
and  are  not  used  to  carry  algorithm-related  information. 
For  instance,  in  a  transportation  network  links  may  carry 
vehicles,  and  a  separate  communication  network,  such  as 
the  telephone  network,  may  be  used  for  transmitting  the 
information  required  by  the  algorithm.  In  that  case,  we 
assume  that  the  existence  of  a  pair  of  oppositely  directed 
network  links  connecting  two  nodes  implies  the  existence 
of  a  bidirectional  communication  link  between  them.  To 
each  link  ( /,  j )  is  associated  a  distance  j(  /.  j ).  which  could 
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represent  physical  distance,  delay,  energy,  money,  or  any 
otfier  quantity  appropriate  to  the  network  under  considera- 
tln.  The  lengths  s(i,  j)  and  s(j,  i)  may,  and  generally  do, 
anfer.  Lengths  are  unrestricted  in  sign,  but  to  guarantee 
the  existence  of  loopless  shortest  paths  the  sum  of  the 
1 1  gihs  in  any  closed  loop  is  assumed  to  be  positive. 

■Very  little  topological  information  is  needed.  Each  node 
needs  to  know  only  the  identity  of  its  neighbors,  the  length 
<  I  the  branch  to,  but  not  from,  each  neighbor,  and  the 
i  I  mtities  of  all  possible  destinations.  (Actually,  a  list  of  all 
destinations  need  not  be  known  initially.  It  is  possible  to 
( i  nstruct  the  list  during  the  course  of  the  algorithm.) 


A  The  Algorithm 

I  The  algorithm  itself  is  rather  simple.  For  each  destina- 
I  >n.  a  node  calculates  and  stores  an  assessment  of  the 
shortest  distance,  based  on  information  received  to  date, 


V 

X 

Distance  matrix  for  node  i 


j  i  each  of  its  neighbors.  The  smallest  of  these  is  taken  to 
I  its  assessment  of  the  shortest  distance  to  that  destina¬ 
tion.  and  is  subsequently  referred  to  as  the  current  shortest 
A1 stance .  Also  stored  is  the  identity  of  the  neighbor(s)  via 
j  uch  this  minimal  distance  is  achieved  [i.e.,  the  next 
riode(s)  in  the  shortest  path(s)].  Whenever  a  node’s  current 
shortest  distance  to  a  destination  decreases,  either  through 
I  itialization  or  new  information  received  from  a  neighbor, 
.i.e  node  transmits  this  distance  to  each  of  its  neighbors, 
thus  allowing  each  neighbor  to  update  the  corresponding 
stance  assessment  by  adding  the  distance  just  received  to 
e  known  length  of  the  link  to  the  sender.  Distance 
messages  from  a  node  to  any  of  its  neighbors  are  assumed 
be  transmitted  directly  along  the  (communication) 
anch  connecting  them,  and  received  in  the  order  in  which 
they  are  sent.  The  algorithm  continues  in  this  manner  until 
no  further  changes  can  be  made.  At  this  point,  which  must 
:cur  in  finite  time,  each  node  knows  the  length  of  the 
snortest  path  to  each  destination  (or  that  no  path  exists), 
the  next  node  in  this  path,  and  the  shortest  distance  via 
ich  alternative  neighbor.  A  precise  description  of  this 
_.gorithm  appears  in  the  Appendix. 

Convergence  of  the  Algorithm 

It  is  convenient  for  purposes  of  presentation  to  arrange 
the  data  stored  by  each  node  in  an  N  X  jV  matrix.  Fig.  1 
lows  the  matrix  maintained  by  node  /;  each  row  corre¬ 
sponds  to  an  ultimate  destination,  each  column  to  an 
adjacent  node  that  represents  the  next  node  in  various 
aths  to  the  final  destinations.  Row  /,  which  would  repre- 
:nt  distances  from  node  /  to  itself,  and  columns  corre¬ 
sponding  to  nodes  not  adjacent  to  i  are  crossed  out,  except 
'  >r  column  /.  which  serves  a  special  function  that  is  de- 
;ribed  subsequently.  The  quantity  d(t,  Ac;  j ),  stored  in  the 
(k.  j)  element,  is  the  distance  assessment  from  node  t  to 
node  k  via  next  node  j,  d(i,k)  =  min,d(/,  Ac;  j)  is  the  cur- 
I  ,*nt  shortest  distance  from  i  to  k.  and  n(  i,  k )  the 
ueighbor(s)  achieving  the  distance  d( i,  k );  both  d(i.  k )  and 
n(  i.  k  )  are  stored  in  the  Acth  element  of  column  /,  this  being 
le  special  role  of  the  ith  column.  Diagonal  elements 


represent  direct  distances,  i.e.,  lengths  of  links  from  node  i 
to  its  neighbors. 

In  Fig.  2,  the  distance  matrices  are  stacked  vertically  to 
form  a  distance  cube.  While  the  topological  and  distance 
information  necessary  to  construct  and  update  the  distance 
cube  is  not  localized,  so  that  the  cube  is  not  available  to 
any  node  of  the  network,  we  can  utilize  the  concept  of  the 
distance  cube  in  order  to  observe  and  analyze  the  behavior 
of  the  algorithm. 

Node  i’s  current  shortest  distance  to  final  destination  Ac, 
stored  as  d( i,  k )  in  the  (Ac,  i)  element  of  i's  distance  matrix, 
is  also  reflected  in  the  ( Ac,  i )  element  of  each  of  node  i ’s 
neighbors;  in  particular,  the  (Ac,t)  entry  in  node  j's  dis¬ 
tance  matrix  is  s(  j,  i)+  d{ i.  k ).  Transmission  of  any  change 
in  d(i.k)  must  therefore  be  reflected  in  the  distance  cube 
by  information  transfer  along  a  vertical  line  through  the 
(At,  i)  element  of  each  node’s  distance  matrix.  Thus,  com¬ 
munication  of  distance  information  occurs  along  vertical 
lines  in  the  cube.  Also,  in  a  given  node’s  distance  matrix, 
the  operations  in  any  row  are  self-contained  and  indepen¬ 
dent  of  those  in  any  other  row;  the  initialization  step  and 
each  subsequent  operation  of  updating  a  distance  assess¬ 
ment  and  comparing  it  to  the  current  shortest  distance 
involves  only  elements  in  a  specific  row.  Together,  these 
observations  reflect  the  fact  that  the  “  vertical  slice”  corre¬ 
sponding  to  each  fixed  destination  is  self-contained  insofar 
as  both  communication  and  addition-comparison  opera¬ 
tions  are  concerned.  Convergence  of  the  algorithm  is  most 
easily  proven  by  decomposing  the  distance  cube  into  verti¬ 
cal  slices  for  which  convergence  can  be  proven  separately 
and  individually.  The  following  is  an  outline  of  a  straight¬ 
forward  proof  for  the  static  algorithm  that  can  be  found  in 
[6],  The  Appendix  contains  a  detailed,  but  somewhat  dif¬ 
ferent.  proof  that  is  better  suited  to  the  subsequent  proof 
for  the  dynamic  algorithm. 

Consider  the  vertical  slice  corresponding  to  any  destina¬ 
tion,  say  node  k.  Any  node  whose  direct  path  to  k  is  a 
shortest  path,  has  this  distance  available  initially.  Any 
other  distance  assessment  for  destination  k  possessed  by 
the  node  must  be  greater  than  or  equal  to  this  shortest 
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Fig.  2.  Distance  cube. 


distance.  Although  the  node  is  unaware  that  the  direct 
distance  to  k  is  optimal,  it  must  find  the  distance  to  be  the 
minimal  element  in  row  k,  take  it  to  be  the  current  shortest 
distance  to  node  k,  and  transmit  this  distance  to  each  of  its 
neighbors.  Similarly,  a  node  that  has  a  shortest  path  con¬ 
taining  two  links  constructs  the  shortest  distance  to  k  after 
receiving  the  optimal  distance  from  the  intermediate  node 
in  the  path  (which,  by  the  principle  of  optimality,  must  be 
a  node  whose  direct  path  to  k  is  a  shortest  one),  and  takes 
that  distance,  again  unaware  of  its  optimality,  to  be  its 
current  shortest  distance.  In  this  manner,  knowledge  of 
shortest  distances  spreads  until  each  node  in  the  network 
has  found  the  shortest  distance  to  k.  The  same  is  true  for 
each  destination,  and  every  distance  in  the  distance  cube  is 
guaranteed  to  converge  to  the  optimum  in  finite  time. 

We  note  that  the  convergence  proof  focuses  only  on  the 
construction  and  transmission  of  optimal  distances,  ignor¬ 
ing  the  suboptimal  distances  which  may  be  transmitted 
prior  to  the  construction  of  shortest  distances.  This  focus¬ 
ing  on  optimal  information  brings  out  a  resemblance  be¬ 
tween  the  static  algorithm  and  dynamic  programming.  For 
a  given  destination,  think  of  the  destination  as  the  final 
dynamic  programming  stage.  The  set  of  nodes  that  have  a 
shortest  path  containing  one  link  comprises  the  next-lo-last 
stage.  The  stage  preceding  this  consists  of  all  nodes  having 
shortest  paths  with  exactly  two  links,  and  so  on.  (Note  that 
a  node  can  be  in  several  different  stages  if  it  has  more  than 
one  shortest  path.)  When  the  nodes  in  stage  n  have  found 
their  shortest  distances  to  the  destination  and  have  sent 
them  to  their  neighbors,  the  nodes  in  stage  n  —  1  are  then 
able  to  find  their  shortest  distances.  Thus,  the  static  algo¬ 


programming  works.  This  dynamic  programming  compari¬ 
son  is  appropriate  to  the  analysis,  but  not  the  actual 
operauon,  of  the  static  algorithm. 

C.  Convergence  Time 

Since  asynchronous  operation  of  the  static  algorithm 
allows  a  node  to  wait  indefinitely  before  transmitting  new 
information,  convergence  time  can  be  bounded  only  by 
bounding  this  delay.  Suppose  St  is  the  maximum  amount 
of  time  required  for  a  node  to  receive,  process,  and  relay  a 
distance  message.  Using  the  convergence  proof,  it  can  be 
seen  easily  from  the  discussion  above  that  a  node  having  a 
shortest  path  containing  the  maximum  possible  V  —  1  links 
must  possess  the  corresponding  shortest  distance  after  at 
most  ( N  —  2 )St  units  of  time,  and  this  is  therefore  an  upper 
bound  on  the  time  taken  for  each  node  to  know  its  shortest 
distance  to  every  other  node.  After  one  more  umt  of  time, 
every  node  knows  “everything,”  i.e..  each  element  of  the 
distance  cube  achieves  the  optimum  in  at  most  ( N  —  1  )St 
units  of  time.  Bounding  the  number  of  operations  is  much 
more  difficult.  A  more  detailed  discussion  of  convergence 
bounds  can  be  found  in  [1 1], 

III.  Dynamic  Networks 

While  the  above  algorithm  handles  static  networks,  many 
problems  arise  for  a  dynamic  network  in  which  branch 
lengths  may  decrease  or  increase,  new  branches  (nodes) 
may  be  introduced  into  the  network,  and  branches  (nodes) 
may  be  removed  from  the  network.  It  should  be  noted  that 
adding  or  removing  a  node  can  be  treated  by  simulta¬ 
neously  adding  or  removing  the  incident  set  of  branches. 
Again,  we  point  out  that  there  may  be  a  distinction  be¬ 
tween  network  links  and  communication  links.  If  so.  we 
assume  that  a  network  link  can  become  infinite  in  length 
without  being  removed  from  the  network,  provided  that 
the  corresponding  communication  link  remains  intact.  A 
network  link  can  be  removed  from  the  network,  in  which 
case  the  communication  link  is  also  removed.  Or  when 
adding  new  links  to  the  network,  a  communication  link  can 
be  activated  before  the  corresponding  network  link.  But  we 
assume  that  a  communication  link  cannot  be  removed  if 
the  corresponding  network  link  remains  functional. 

The  easiest  case  to  handle  is  that  of  decreasing  branch 
lengths.  The  algorithm  is  easily  modified  as  follows:  a  node 
detecting  a  decreased  branch  length  to  some  neighbor 
decreases  all  distance  assessments  via  that  neighbor  by  the 
amount  of  the  detected  change,  then  performs  a  compari¬ 
son  operation  in  each  affected  row  to  find  the  new  current 
shortest  distance  to  the  ultimate  destination  represented  bv 
that  row.  The  proof  is  very  similar  to  the  one  previouslv 
outlined.  Suppose  that  a  finite  number  of  branch  length 
decreases  occur  before  some  time  tf.  after  which  no  de¬ 
creases  occur  for  some  period  of  time.  The  topology  at  time 
tf  determines  the  set  of  shortest  distances  to  which  the 
algorithm  should  converge.  A  crucial  element  of  the  con¬ 
vergence  proof  for  the  static  algorithm  is  that,  at  any  time, 
each  distance  assessment  is  no  smaller  than  the  correspond- 
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Fig  3  Dynamic  network  for  which  static  algorithm  does  not  converge 

appears  in  the  distance  matrix,  it  is  the  smallest  element  of 
its  row  and  is  accepted  as  the  current  shortest  distance. 
Since  branch  length  decreases  can  cause  decreases,  but  not 
increases,  in  actual  shortest  distances,  this  condition  is  met 
for  the  shortest  distances  at  time  tf,  and  convergence  to 
these  values  follows  from  the  original  static  convergence 
proof. 

On  the  other  hand,  an  increased  branch  length  can  cause 
an  increased  actual  shortest  distance,  violating  this  condi¬ 
tion  and  invalidating  the  original  convergence  proof.  In 
this  case,  the  algorithm  may  not  converge  to  the  optimal 
solution  in  finite  time,  as  illustrated  by  the  example  in 
Fig.  3. 

For  this  example,  we  restrict  attention  to  destination  3, 
and  assume  that  nodes  1  and  2  have  converged  to  the 
obvious  shortest  distances  of  4  and  5,  respectively.  Addi¬ 
tionally,  node  2  has  optimal  distance  of  7  via  node  1. 
Suppose  that  branch  length  r(2.3)  increases  to  infinity. 
Node  2,  observing  this  increase,  would  change  its  current 
shortest  distance  to  the  distance  of  7  via  node  1,  and 
transmit  this  new  data  to  1.  (Note  that  the  actual  shortest 
distance  of  infinity  is  possessed  by  node  2,  but  the  distance 
assessment  7  is  smaller  than  this  shortest  distance.)  The 
current  shortest  distance  for  node  1  then  becomes  8,  and  is 
sent  to  node  2.  This  increases  the  shortest  distance  for  node 
2  to  10,  which  is  then  sent  to  node  1.  etc.  The  distance 
assessments  to  node  3  for  both  nodes  1  and  2  increase 
without  bound,  but  do  not  reach  the  correct  value  of 
infinity  in  fimte  time. 

One  could  remedy  this  problem  by  establishing  a  maxi¬ 
mum  possible  shortest  distance.  Any  distance  exceeding 
this  maximum  could  be  taken  as  infinity.  Nevertheless,  the 
convergence  time  of  the  static  algorithm  can  be  very  large 
when  increases  occur,  as  in  the  example  above. 

This  example  illustrate  the  fact  that  a  single  topological 
change  can  prevent  convergence  of  our  static  algorithm. 
Even  for  the  dynamic  algorithms  that  we  have  developed  to 
accommodate  topological  changes,  convergence  is  not  pos¬ 
sible  if  these  changes  occur  with  such  frequency  that  the 
optimal  solution  always  changes  before  the  algorithm  has 
time  to  converge  to  a  solution.  Therefore,  we  must  assume 
for  each  of  our  dynamic  algorithms  that  topological  changes 
cease  for  some  period  of  time,  sufficiently  long  to  allow 
convergence  to  take  place,  after  which  topological  changes 
may  resume.  A  sufficient  period  without  topological  change 
would  be  2(  N  —  1  )St.  where  St  is  as  defined  in  the  previous 
section. 

We  point  out  that  our  definition  of  convergence  may 
differ  from  that  used  elsewhere.  For  instance,  suppose  each 
distance  in  a  network  oscillates  between  two  values  as  a 
function  of  the  routing  scheme  being  used,  as  in  an  exam¬ 
ple  presented  by  Bertsekas  (13)  in  the  context  of 
computer-communication  networks.  Though  the  set  of 
'honest  paths  oscillates  between  two  solutions,  we  consider 


any  algorithm  that  can  find  these  shonest  paths  to  be 
convergent. 

Since  the  performance  of  the  static  algonthm  is  poor  for 
dynamic  networks,  we  have  developed  several  modified 
versions  of  the  static  algorithm,  some  of  which  are  de¬ 
scribed  in  this  paper.  Each  of  these  algorithms  is  based 
primarily  on  some  form  of  reinitialization  of  the  basic 
static  algorithm;  they  differ  mainly  in  the  mechanics  of  the 
reinitialization.  The  remainder  of  this  section  is  devoted  to 
our  main  dynamic  algorithm.  The  algorithm  contains  a 
procedure  to  treat  branch  length  increases  and  removal  of 
branches  and  a  procedure  for  adding  new  links. 

A.  Assumptions 

For  the  dynamic  case,  we  make  the  following  assump¬ 
tions  about  the  network  and  the  performance  of  the  indi¬ 
vidual  nodes. 

Assumption  1:  There  exist  times  /'  and  tf  such  that  any 
finite  number  of  topological  changes  can  occur  between  /' 
and  if.  but  that  none  occurs  for  some  period  of  time  after 

'/• 

Assumption  2:  Each  node  knows  the  distance  to  each  of 
its  neighbors  at  all  times. 

Assumption  3:  At  t\  the  distance  assessment  from  i  to  k 
via  j  is  greater  than  or  equal  to  the  sum  of  the  length  of 
link  (i,  j)  and  the  current  shortest  distance  d(j,  k )  for  any 
choice  of  /,  k.  and  j.  That  is,  node  i  cannot  have  knowledge 
of  a  better  distance  from  j  to  k  than  node  j  has. 

Assumption  4:  At  each  distance  assessment  d(i.  k:  j) 
is  no  smaller  than  the  actual  shortest  distance  from  i  to  k. 

Assumption  5:  There  exists  an  upper  bound  At.  known 
to  each  node  in  the  network,  on  the  amount  of  time  it  can 
take  a  message  to  traverse  any  loopless  path  in  the  net¬ 
work,  then  traverse  one  more  link.  A  possible  choice  for  A  t 
would  be  N  times  the  maximum  possible  processing  and 
transmission  time  required  for  a  message  to  be  sent  be¬ 
tween  neighbors.  Note  that  calculating  A t  may  require 
some  centralized  information,  although  a  decentralized 
scheme  for  determining  A  t  could  probably  be  developed. 

Assumption  6:  Each  node  is  equipped  to  measure  elapse 
of  time,  although  a  time  base  common  to  all  nodes  is  not 
required. 

B.  The  Local  Reinitialization  Procedure 

It  is  not  sufficient  to  merely  disseminate  a  reinitializa¬ 
tion  command  throughout  the  network  when  a  branch 
length  increase  or  removal  of  a  branch  occurs.  Once  a  node 
has  reinitialized,  it  needs  some  guarantee  that  subsequently 
received  information  is  based  on  all  affected  nodes  having 
reinitialized  also.  Thus,  a  mechanism  has  been  devised  for 
each  node  to  determine  which  distance  information  it 
receives  is  trustworthy  (in  that  all  nodes  further  down  the 
corresponding  path  are  guaranteed  to  have  reinitialized) 
and  which  is  not.  In  simple  terms,  on  hearing  that  a  branch 
length  increase  or  branch  removal  has  taken  place,  a  node 
ignores  distance  information  (but  not  other  messages)  sent 
by  any  questionable  neighbor  until  that  neighbor  acknowl¬ 
edges  that  it.  too.  is  aware  of  the  change.  As  each  neighbor 
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in  turn  so  acknowledges,  the  embargo  on  its  information  is 
removed.  In  this  way,  some  convergence  toward  the  new 
solution  can  be  taking  place  while  news  of  the  change  is 
still  propagating  through  the  network. 

More  precisely,  a  local  reinitialization  message  is  initiated 
by  a  node  detecting  an  increase  in  length  or  removal  of  any 
link.  When  this  occurs,  the  initiator  assigns  a  unique  index 
to  the  generated  message  (consisting  of  the  initiator’s  iden¬ 
tity  and  a  counter),  and  does  the  following: 

Step  l:  Reinitializes  by  eliminating  every  distance  as¬ 
sessment  in  memory  except  for  direct  distances,  and  by 
determining  new  current  shortest  distances. 

Step  2:  Places  an  embargo,  with  the  proper  index,  on 
distances  received  from  each  of  its  neighbors. 

Step  3:  Transmits  all  of  its  current  shortest  distances  to 
each  of  its  neighbors,  along  with  the  indexed  local  reini¬ 
tialization  message. 

A  node  other  than  the  initiator  takes  similar  action  upon 
receiving  a  particular  local  reinitialization  message  for  the 
first  time.  Each  embargo  placed  by  a  node  due  to  a  local 
reinitialization  is  removed  as  the  local  reinitialization  mes¬ 
sage  is  received  from  the  corresponding  embargoed  neigh¬ 
bor.  Specifically,  when  a  message  is  received  from  a 
neighbor,  a  node’s  action  is  governed  by  the  following: 

Case  1:  Message  contains  no  local  reinitialization. 

a)  If  there  is  no  embargo  on  this  neighbor,  calculate 
the  new  distance  assessments  via  this  neighbor,  perform 
addition-comparison  operations,  and  transmit  any  new 
current  shortest  distances  to  each  neighbor. 

b)  Otherwise,  ignore  the  message. 

Case  2:  Message  contains  a  local  reinitialization. 

a)  If  there  is  no  embargo  with  matching  index  already 
on  this  neighbor,  calculate  the  new  distance  assessments 
via  this  neighbor  (unless  another  embargo  is  on  the  neigh¬ 
bor.  in  which  case  distance  calculations  are  omitted),  and 
perform  Steps  1  -3  above,  except  that  the  distance  assess¬ 
ments  from  the  neighbor  just  heard  from  are  not  deleted  or 
embargoed. 

b)  If  there  is  a  matching  index,  remove  that  embargo 
from  this  neighbor.  If  no  other  embargo  exists  on  the 
neighbor,  take  action  as  in  Case  la).  Otherwise,  ignore  the 
distance  component  of  the  message  and  take  no  further 
action. 

Any  finite  number  of  local  reinitializations  can  exist 
within  the  network  simultaneously,  each  distinguishable  by 
its  index.  A  branch  removal  is  treated  by  eliminating  the 
appropriate  node  from  the  set  of  neighbors. 

C.  The  Procedure  for  Adding  Links 

The  above  procedure  alone  would  be  sufficient  if  the 
addition  of  new  links  and  nodes  did  not  occur.  The  crucial 
goal  of  the  procedure  is  to  guarantee  to  each  node  that  any 
neighbor  affected  by  an  increase  has  reinitialized  before 
distance  information  from  it  is  accepted.  If  a  node  sends  a 
local  reinitialization  message  to  each  of  its  neighbors,  then 
acquires  a  new  neighbor  through  the  activation  of  a  new 
link,  there  would  be  no  guarantee  that  this  new  neighbor 
had  recently  reinitialized  or  that  its  distance  information 


To  eliminate  this  problem,  suppose  that  two  nonadjacent 
nodes  wish  to  activate  a  new  pair  of  oppositely  directed 
links  between  them,  and  that  they  make  initial  contact  for 
this  purpose  at  the  time  tm.  (If  communication  links 
differ  from  network  links,  til’s  contact  is  made  possible  by 
activating  the  communication  link  before  the  network  link 
is  added.  In  any  event,  we  assume  that  some  means  of 
communication  exists  in  this  situation.)  Then,  in  order  to 
activate  these  links,  each  node  executes  the  following  steps: 

Step  1:  During  the  interval  [z^,  t^  +  St ),  where  St  is 
the  maximum  possible  transmission  delay  between  any  pair 
of  nodes,  as  defined  in  Section  1 1- A,  store  the  index  of  each 
local  reinitialization  message  received  or  initiated.  Also,  in 
consultation  with  the  tentative  new  neighbor,  determine 
which  of  the  two  will  initiate  a  local  reinitialization  mes¬ 
sage  when  the  links  are  activated. 

Step  2:  At  time  ■+•  St,  activate  the  new  links,  and 
add  the  new  neighbor  to  the  set  of  neighbors.  Then,  if  this 
node  is  the  initiator  agreed  upon  in  Step  1.  initiate  a  local 
reinitialization  message. 

Step  3:  During  the  interval  +  St.  t^  +  2Ar),  com¬ 
pare  the  index  of  any  local  reinitialization  message  received 
from  the  newly  acquired  neighbor  to  the  indexes  stored  in 
Step  I.  If  this  index  is  m  the  set,  do  not  reinitialize.  Rather, 
transmit  the  local  reinitialization  back  to  the  new  neighbor, 
then  proceed  as  in  Case  1  of  the  previous  section.  If  this 
index  is  not  in  the  set.  proceed  with  the  normal  reimualiza- 
tion  steps. 

Step  4:  At  time  t^  +  2A /.  remove  the  indexes  stored  in 
Step  1  from  memory,  compietmg  the  new- link  procedure. 

The  waiting  period  of  St  and  the  storing  of  indexes 
received  during  this  period  serve  to  prevent  a  particular 
local  reinitialization  message  from  causing  any  node  to 
reinitialize  more  than  once,  a  helpful  fact  in  proving  con¬ 
vergence.  The  validity  of  distance  information  exchanged 
by  the  two  new  neighbors  is  guaranteed  by  the  local 
reinitialization  message  initiated  upon  activation  of  the 
new  links. 

The  two  procedures  just  described,  one  for  initiating 
local  reinitialization  messages  and  one  for  adding  new 
links,  are  the  major  components  of  our  main  dynamic 
algorithm.  This  algorithm  is  described  in  detail  in  the 
Appendix. 

D  Convergence 

The  most  important  property  of  the  dynamic  algorithm 
is  the  guarantee,  given  our  assumptions,  that  it  will  con¬ 
verge  to  the  optimal  solution  of  the  shortest  path  problem, 
regardless  of  the  number  and  type  of  topological  changes 
that  occur.  The  convergence  proof,  which  appears  in  detail 
in  the  Appendix,  is  outlined  here. 

After  the  time  tf.  defined  in  Assumption  1.  the  topology 
of  the  network  is  fixed  for  some  period  of  time,  and 
consists  of  one  or  more  components,  i.e..  maximal  con¬ 
nected  subnetworks.  Consider  any  component.  If  no  node 
in  this  component  has  ever  initiated  a  local  reinitialization 
message,  then  the  algorithm  behaves  like  the  static  one 
with  respect  to  this  component.  Otherwise,  there  exists  a 
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/ .  initiated  within  the  component.  The  component  must  be 

Ionnected  at  this  initiation  time,  guaranteeing  that  each 
ode  of  the  component  receives  the  message  and  reinitial¬ 
izes  after  the  final  initiation  time.  Since  no  further  in¬ 
creases  or  failures  can  occur  in  the  component  (since  it  is 
Issumed  that  no  further  local  reinitializations  are  gener¬ 
ated).  and  since  reinitialization  is  guaranteed  for  each 
component  node  at  a  time  after  which  only  decreases  in 

(istances  can  occur,  we  can  prove  the  existence  of  a  time 
fter  which  each  distance  assessment  in  the  component  is 
no  smaller  than  the  corresponding  shortest  distance.  The 
ime  is  true  for  each  component. 

Then  it  is  shown  that  once  an  embargo  is  placed  on  a 
neighbor,  it  must  be  removed  either  through  a  reply  from 
j'lat  neighbor  or  removal  of  the  neighbor  from  the  set  of 
I  eighbors.  Also  proven  is  that  once  an  embargo  is  lifted,  it 
cannot  be  replaced.  Thus,  there  exists  a  time  after  which  all 
embargoes  in  the  network  have  been  removed.  This  re- 
|  ioval  of  all  embargoes  guarantees  to  each  node  the  availa¬ 
bility  of  a  distance  assessment  via  each  neighbor. 

Finally,  once  all  embargoes  have  been  removed  and 
istance  assessments  are  no  smaller  than  corresponding 
lortest  distances,  the  dynamic  algorithm  behaves  like  the 
stauc  one.  and  convergence  to  the  optimal  solution  follows 
om  the  guaranteed  convergence  of  the  static  algorithm. 
Convergence  for  this  dynamic  algorithm  is  guaranteed  to 
occur  within  2(W-l)8r  units  of  time  after  the  “final” 
•ipological  change;  it  can  take  (N-l)St  units  for  each 
ode  to  have  reinitialized  and  an  additional  ( N  -  1  )St  units 
lor  convergence  to  the  optimum  for  all  possible  destina¬ 
tions.  One  would  expect  average  convergence  time  to  be  far 
ipenor  to  this  worst  case  bound. 

The  dynamic  algorithm  requires  more  memory  than  the 
stauc  one  in  order  to  keep  track  of  embargoes  and  indexes, 
requires  that  nodes  activating  new  links  have  clocks  and 
tow  ledge  of  Ar  determination  and  dissemination  of  which 
may  require  some  centralized  information,  although  A t 
juld  probably  be  determined  in  a  decentralized  fashion, 
ut  aside  from  the  knowledge  of  A r.  the  algorithm  is 
informationally  decentralized,  requires  only  local  com¬ 
munication.  is  implemented  asynchronously,  and  is 
iaranteed  to  converge  to  opumality. 

IV.  Alternative  Algorithms 

Each  of  the  following  alternative  algorithms  requires, 
although  each  for  a  different  purpose,  occasional  use  of 
hat  we  refer  to  as  a  global  reinitialization  procedure.  This 
rocedure  brings  about  global  reinitialization  by  effectively 
suspending  all  communication  of  distance  information  for 
sufficiently  long  period  of  time  (i.e.,  A t )  to  ensure  that  all 
odes  have  reinitialized.  Several  possible  mechanisms  for 
achieving  this  present  themselves;  one  is  for  the  initiator  of 
1  he  action  to  decide  upon  a  future  time  at  which  communi- 
ltion  of  distance  information  based  on  reinitialized  data 
•sill  resume,  and  to  send  this  time  to  each  of  its  neighbors, 
which  continue  to  propagate  the  message  throughout  the 
etwork.  Implicit  here  is  the  existence  of  a  time  base 
immon  to  all  nodes,  and  the  availability  to  each  node  of 
the  value  of  A t.  which  may  or  may  not  require  some  global 


information,  as  mentioned  earlier.  (Other  mechanisms  exist 
for  which  a  common  time  base  is  unnecessary.) 

While  the  message  to  reinitialize  is  received  by  different 
nodes  at  different  times,  there  is  a  simultaneous  resump¬ 
tion  of  communication  ot  distance  information,  hence  the 
reinitialization  is  effectively  a  global  one.  This  differs  greatly 
from  the  local  reinitialization  of  the  previous  section.  There, 
a  node  reinitializes  upon  receipt  of  a  message  to  do  so.  and 
begins  transmitting  distance  information  immediately 
thereafter. 

Perhaps  the  most  practical  utilization  of  the  global  re¬ 
initialization  procedure  is  as  a  replacement  for  the  new-link 
procedure  in  the  previous  dynamic  algorithm.  The  proce¬ 
dure  is  initiated  by  either  of  the  nodes  wishing  to  bring  up 
the  new  link.  By  empowering  the  global  reinitialization  to 
eliminate  embargoes,  when  communications  resume  at  the 
agreed  upon  time  each  node  receiving  the  global  reinitiali¬ 
zation  message  has  removed  all  embargoes,  and  has  no 
distance  assessments  except  for  direct  distances.  Some 
nodes  will  not  receive  this  message  if  the  network  is  not 
connected,  but  they  will  be  unaffected  by  the  new  link. 
Thus,  the  new  link  is  added  without  allowing  an  old  local 
reinitialization  message  to  be  transmitted  over  it.  pre¬ 
venting  a  second  reinitialization  by  a  node  that  received  it 
previously,  and  preventing  incorrect  distance  information 
from  being  exchanged  by  the  new  neighbors.  This  alterna¬ 
tive  algorithm  can  be  proven  to  converge  to  optimality. 

Another  alternative,  also  proven  to  converge,  involves 
superimposing  an  acknowledgment  system  on  the  local 
reinitialization  procedure,  which  is  then  used  to  simplify 
the  introduction  of  new  links.  In  this  algorithm,  when  a 
node,  say  i.  first  receives  any  particular  local  reinitializa¬ 
tion,  it  remembers  the  identity  of  the  node  sending  the 
message,  that  node  being  referred  to  as  the  upstream 
neighbor  for  this  reinitialization.  Each  node  that  first  re¬ 
ceives  the  message  from  node  i  is  a  downstream  neighbor  of 
i,  and  a  tree  is  created  as  the  message  spreads  downstream. 
(Actually,  the  same  type  of  tree  is  implicitly  generated  in 
the  dynamic  algorithm  of  Section  III.)  When  a  node  has  no 
downstream  neighbor,  either  because  it  has  no  neighbors 
besides  the  upstream  one  or  because  all  of  its  other  neigh¬ 
bors  have  already  joined  the  tree,  or  when  a  node  has 
received  a  special  acknowledgment  from  each  of  its  neigh¬ 
bors.  it  then  sends  special  acknowledgment  to  its  upstream 
neighbor.  Embargoes  are  placed  and  removed  as  before, 
but  the  index  of  each  local  reinitialization  is  remembered 
until  a  node  is  told  otherwise.  This  occurs  when  the  initia¬ 
tor  has  received  a  special  acknowledgment  from  each  of  its 
neighbors,  reflecting  the  fact  that  every  node  connected  to 
the  initiator  by  some  path  has  received  the  local  reinitiali¬ 
zation.  The  initiator  then  generates  a  message  that  is 
propagated  through  the  network  to  erase  the  tree  and  all 
memory  of  the  corresponding  index. 

The  new-link  mechanism  for  this  algorithm  is  very  simi¬ 
lar  to  the  one  presented  earlier,  except  that  instead  of 
remembering  indexes  only  duing  the  interval  [;nfw.  tnn  t 
At),  the  necessary  indexes  are  already  in  memory  at  infw. 
thus  eliminating  the  delay  of  At  when  activating  a  new  link. 
As  before,  a  local  reinitialization  is  generated  when  the  link 
is  activated. 
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The  global  reinitialization  procedure  is  used  in  this  algo¬ 
rithm  as  an  emergency  action.  A  removal  of  a  link  or  node 
of  a  tree  can  disrupt  the  flow  of  acknowledgments  to  the 
initiator,  or  of  “erasure"  messages  from  the  initiator.  To 
guarantee  the  eventual  removal  of  all  memory  of  indexes  in 
the  event  of  removal  of  a  tree  branch,  the  global  reini¬ 
tialization  procedure  is  invoked.  Without  this  action, 
convergence  would  still  occur,  but  some  indexes  may  be 
remembered  indefinitely,  long  after  they  are  no  longer 
needed. 

In  addition  to  the  disadvantage  of  requiring  the  use  of 
the  global  reinitialization  procedure  in  this  algorithm,  addi¬ 
tional  memory  is  required  for  each  node  to  remember  all 
indexes  received,  when  actually  indexes  need  be  remem¬ 
bered  only  when  adding  a  new  link  and  only  by  the  two 
nodes  adding  the  link. 

On  the  other  hand,  it  is  an  algorithm  that  is  guaranteed 
to  converge  to  optimality  in  a  dynamic  network.  Also,  we 
feel  that  the  concept  of  the  acknowledgment  system  has 
potential  value  in  other  network  situations  where  a 
guarantee  that  every  node  has  received  some  message  may 
be  required. 

This  alternative  algorithm  is  similar  to  one  developed  by 
Merlin  and  Segall  [12]  in  that  both  utilize  acknowledgment 
systems,  but  many  differences  between  the  two  algorithms 
exist.  A  major  difference  is  that  in  our  algorithm  the  node 
detecting  the  topological  change  generates  the  reinitializa¬ 
tion:  in  theirs  each  destination  controls  the  updating  of 
paths.  To  solve  the  shortest  path  problem  for  all  desti¬ 
nations.  the  Merlin/Segall  algorithm  would  apparently 
require  that  messages  requesting  update  be  sent  to  all 
possible  destinations  after  each  critical  topological  change. 

The  upper  bound  on  convergence  time  for  each  of  the 
alternative  algorithms  is  2(  N  -  1  )Si  after  the  “  final”  change, 
as  was  the  case  for  the  main  dynamic  algorithm.  An 
additional  (iV-l)6i  may  be  required  in  the  acknowledg¬ 
ment  algorithm  before  all  trees  are  removed. 

Several  modifications  have  been  developed  for  the  algo¬ 
rithms  presented  thus  far.  Information  storage  require¬ 
ments  can  be  reduced  by  storing  current  shortest  distances, 
but  not  distance  assessments.  The  assumption  that  network 
links  are  bidirectional  can  be  relaxed  and  the  algorithms 
modified  accordingly.  The  number  of  reinitializations  gen¬ 
erated  can  possibly  be  reduced  significantly  by  requiring  a 
node  to  generate  such  action  only  when  a  link  length 
increase  satisfying  certain  conditions  occurs.  Alternative 
procedures  for  adding  new  links  have  also  been  developed. 
These  modifications  and  others  are  discussed  in  [1 1], 

V.  Conclusions 

Our  goal  in  this  paper  has  been  to  present  an  application 
of  decentralized  information  and  control  to  the  solution  of 
the  shortest  path  problem.  The  algorithms  that  we  have 
developed  to  solve  this  problem  share  several  desirable 
properties. 

Each  algorithm  operates  asynchronously;  a  node  acts 
only  when  the  situation  warrants  action.  Moreover,  each 
.ileomhm  can  be  imolemented  without  even  the  need  for  a 


time  base  common  to  every  node  in  the  network.  Different 
nodes  can  have  different  capacities  for  processing  and 
transmitting  algorithm-related  messages.  These  messages 
can  be  acted  upon  and  subsequently  transmitted  by  the 
various  nodes  in  any  order. 

The  communication  requirements  of  each  algorithm  are 
totally  decentralized,  the  topological  information  require¬ 
ments  decentralized  except  for  knowledge  of  A t\  A t  can 
probably  be  determined  in  a  decentralized  fashion,  a  possi¬ 
bility  that  is  currently  under  investigation.  The  highly 
localized  nature  of  the  algorithms  limits  their  computa¬ 
tional  efficiency.  Efficiency  can  be  improved  by  giving 
nodes  access  to  additional  information,  as  in  the  algorithm 
in  [14],  for  instance,  in  which  each  node  is  allowed  to 
construct  the  entire  network  topology.  Perhaps  this  degree 
of  efficiency  can  be  matched  or  exceeded  by  algorithms 
requiring  only  some  centralized  information. 

The  most  important  property  shared  by  our  algorithms 
is  that  each  can  be  proven  to  converge  to  the  optimum  in 
finite  time.  This  contrasts  with  most  other  applications  of 
decentralized  control  in  which  suboptimal  performance  is 
the  cost  of  decentralization. 

Appendix 

Notation 

The  following  notation  and  definitions  are  used  in  the 
proofs  and  flowcharts  presented  in  this  Appendix: 

d,{i,  k;  j)  The  distance  assessment  from 

/  to  k  via  j  at  time  t. 

d,(i.  k)  =  min; d, (i,  k\  j)  The  current  shortest  distance 
from  /  to  k  at  time  t. 

n,(i,k)  The  next  node(s)  achieving  the 

distance  d,(i,  k). 

s,(i,  j)  The  length  of  link (i,  j)  at  time 

i. 

d*(i,k)  The  actual  shortest  distance 

from  /  to  k  at  tune  t. 

d*(i,  k:  j )  The  final  shortest  distance 

from  i  to  k  via  j. 

dj(i.  k)  The  final  shortest  distance 

from  f  to  k. 

sf(i.  j)  The  final  length  of  link  (/.  j). 

A  t  An  upper  bound  on  the  maxi- 

•  mum  possible  time  for  a  mes¬ 

sage  to  traverse  any  loopless 
path  plus  one  more  link  (which 
may  form  a  loop). 

counter  ( / )  The  index  counter  last  used  by 

node  i  in  generating  a  local 
reinitialization  message. 

Convergence  of  the  Static  Algorithm 

The  following  convergence  proof  for  the  static  algorithm 
given  in  Fig.  4  is  designed  to  facilitiate  its  application  to 
proving  convergence  for  the  dynamic  algorithm.  A  detailed 
version  of  the  simDler  static  convenience  proof  that  is 
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Proof:  Proof  is  accomplished  by  proving  a  series  of 
lemmas.  The  conditions  for  Theorem  1  are  assumed  to  hold 
for  each  of  these  lemmas. 

Lemma  1:  For  any  d(i,k),  there  exists  a  time  tlk  such 
that 


d,Ji,k)  =  df(i.k).  (1) 

Proof:  If  dT(i,  k)  =  df(i,  k),  then  ( 1 )  is  true  for  t,k  =  T. 
Otherwise,  by  Condition  4, 

dT(i.k)>dJ(i,k).  (2) 

This  implies  that  d*(  i ,  k )  is  finite,  which  implies  the  ex¬ 
istence  of  at  least  one  path  from  i  to  k.  Since  there  are  a 
finite  number  of  paths  from  i  to  k  in  the  network,  there 
must  be  a  shortest  path.  Suppose  one  such  shortest  path  is 

i  —  m0  —  mt  —  m2  mp  =  k.  (3) 

By  the  principle  of  optimality,  for  any  mr  in  (3), 
mr  -  m,+  1 - — ■  mp  —  k 

is  a  shortest  path  from  mr  to  k.  In  particular,  the  direct 
path  from  mp_ ,  to  k  is  a  shortest  path,  and 

d*{m  „k)<dT(mp_t,k)  =  min  dT{m j) 


J 


Fig  4  Actions  taken  bv  node  i  during  static  algorithm  .  ,  ,  \  \  ,  \ 

<dT(mp_l,klk)  =  sT{mp_l,k)  =  df(mp_llk) 


utlmed  in  Section  II  of  this  paper  can  be  found  in  [6], 
Theorem  1:  Suppose  that  the  static  algorithm  is  used  at 
11  times  after  some  time  T,  regardless  of  what  may  have 
een  used  before  T.  and  that  the  following  conditions  are 
satisfied. 

Condition  1:  No  topological  or  distance  changes  occur  in 
le  network  after  ume  T. 

Condition  2:  d:( i,k,  k)  =  sf( i.  k )  for  any  t  >  T.  and  for 
any  i  and  k. 

Condition  3:  d,(  i ,  k:  j)z*  s,(  i,  j)  +  d,{  j,  k )  for  any  t  s»  T, 
nd  for  any  /.  k.  and  j.  k  =*=  j.  (If  d,(i.  k\  j)  is  acquired  by 
node  i  by  some  means  other  than  adding  observed  length 
.11.;)  to  the  distance  d,(  j,  k )  received  from  node  j,  then 
us  condition  guarantees  that  d,(i,  k:  j)  cannot  increase  as 
the  result  of  information  received  from  node  ;.  In  other 
ords.  node  /  cannot  have  access  to  better  information  on 
distance  via  node  j  than  node  j  itself  has.  This  condition 
'a  required  to  guarantee  that  rf,(i.  k )  =  min^O,  k;  j )  in 
he  convergence  proof.) 

Condition  4:  d,{  i,  k:  j)>  d*(  i,  k )  for  any  t  >  T.  and  for 
..nv  i.  k.  and and  therefore  d,(i,  k)  >  d*{i.  k ).  also. 

Condition  5:  Consider  any  dT{  i.  k ).  It  is  assumed  that 
iode  i  has  sent  or  will  send  this  distance  to  each  node  that 
'  a  neighbor  of  node  i  in  the  final  topology.  Furthermore, 
i  the  final  transmission  of  this  distance  by  node  i  is 
cceived  by  any  neighbor ;  prior  to  time  T.  then  dT(  j,  k:  i ) 
I  v<  ;•  i)-1-  dr(  i.  k ). 

Then,  there  exists  a  time  T'  after  which  each  entry  in  the 
iisiance  cube.  i.e..  each  current  shortest  distance  and  each 
nstance  assessment,  is  optimal  with  respect  to  the  final 
ipology. 


Thus, 

dT(mp-],k)~d*(mp_l,k).  (4) 

By  (2),  (4),  and  the  optimality  of  (3),  there  exists  a  largest 
index  /,  /<  p  —  1,  such  that  dT(m:,k)>  dj(m,,k).  Since  / 
is  maximal,  dT(  m,+ , )  =  d*{ m/+ , ,  k ).  The  suboptimality  of 
dT(mh  k)  and  Condition  5  imply  that  node  m,  will  receive 
the  distance  k)  from  node  m(+l  at  some  time 

7 ]  >  T,  at  which 

df(ml,k)<dTi(mi,k)*zdT(ml,k;  ml+l) 

=  sf(m,,  m,+  ,)+  dj(m,+  i,  k)  =  dj(m,,  k). 
i.e.. 

dTi(ml,k)  =  dJ(ml,k). 

In  other  words,  the  optimality  of  dT(ml+]>k)  implies  the 
optimality  of  dT(m^k)  at  some  time  Tt>T,  and  /  — 1 
becomes  the  largest  index  such  that  node  ml_]  has  yet  to 
find  the  optimal  distance  to  node  k.  By  induction,  there 
exists  a  time  70  >  7  such  that 

dr(m0.k)  =  d?(m0,k). 

Let  tlk  =  70;  then 

dji.k)  =  df{i,k).  (5) 

Lemma  2:  For  any  d(  j.  k\  i ).  there  exists  a  time  tJkl  >  T 
such  that 


d,Jj,k-,i)  =  df(jA\i). 


(6) 
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Proof:  If  dT(j,k,i)  =  d*(j,k;i),  t her.  (6)  is  true  for 
tjkl  =  T.  Otherwise,  the  suboptimality  of  dT(  y,  k:  i )  and 
Condition  2  imply  that  the  shortest  distance  from  j  to  k  via 
/  is  not  direct,  i.e.,  k  =*=  /,  and 

d*  ( J .  k ;  / )  =  sf  ( j ,  i )  +  dj  ( / ,  k ) . 

By  (5).  and  by  Condition  5  along  with  the  fact  that  current 
shortest  distances  must  be  transmitted  whenever  they 
change  value,  node  j  must  receive  distance  d*( i,  k )  from 
node  i  at  some  time  r .  ,  yielding 

d,Jj,k:i)  =  d?(j,k:t). 

Lemma  3:  After  time  T,  once  a  current  shortest  distance 
or  distance  assessment  becomes  optimal,  it  remains  opti¬ 
mal. 

Proof:  A  current  shortest  distance  cannot  increase  in 
the  static  algorithm.  Condition  4  states  that  such  a  distance 
cannot  be  less  than  the  optimum.  Therefore,  once  optimal¬ 
ity  is  attained  a  current  shortest  distance  can  neither 
increase  nor  decrease.  A  distance  assessment  can  change 
only  if  the  length  of  the  link  to  the  corresponding  next 
node  changes  or  new  current  shortest  distance  is  received 
from  that  next  node.  Since  the  topology  is  assumed  fixed 
after  T,  and  current  shortest  distances  cannot  change  after 
reaching  optimality,  optimal  distance  assessments  also  re¬ 
main  optimal. 

We  can  now  complete  the  proof  of  Theorem  1.  By 
Lemmas  1-3,  given  any  current  shortest  distance  or  dis¬ 
tance  assessment,  there  exists  a  time  after  which  that 
particular  value  is  guaranteed  to  be  optimal.  Let 

r*=  max 

i.j.k 

Then  Theorem  1  is  true  for  time  7~*. 

Convergence  of  the  Dynamic  Algorithm 

Suppose  that  the  dynamic  algorithm  discussed  in  Section 
III  and  detailed  in  Fig.  5  is  used  in  a  network,  and  that  the 
assumptions  preceding  the  description  of  the  algorithm  in 
that  section  hold  for  each  of  the  lemmas,  corollaries,  and 
theorems  that  follow. 

Lemma  4:  There  exists  a  time  t  such  that 

d,(i,k\  j)> df(i,  k)  (7) 

for  all  i  >  f,  and  for  all  i,  k,  and  j. 

Proof:  Consider  the  topology  of  the  network  at  time 
cf.  Since  changes  are  not  allowed  after  this  time,  the 
topology  of  the  network  is  fixed  after  if,  it  consists  of  one 
or  more  maximal  connected  subnetworks,  which  we  refer 
to  as  components.  Consider  an  arbitrary  component. 

Case  A:  Suppose  no  node  of  the  component  receives  or 
generates  a  local  reinitialization  between  r'  and  t Then 
the  dynamic  algorithm  is  equivalent  to  the  static  algorithm 
with  respect  to  this  component.  The  only  type  of  topologi¬ 
cal  change  that  can  occur  is  link  length  decrease.  Since  (7) 
is  assumed  true  for  r  =  r‘  and  since  the  right-hand  side  of 


(7)  can  decrease  but  not  increase,  it  can  be  proven  [11]  that 
(7)  must  hold  for  all  i  >  t'. 

Case  B:  Otherwise,  at  least  one  local  reinitialization  is 
received  or  generated  by  a  node  in  the  component.  If  it  is 
generated  outside  the  component,  then  the  initiator  has 
become  separated  from  the  component  and  a  local  reini¬ 
tialization  would  be  generated  due  to  the  separation,  so  at 
least  one  local  reinitialization  originated  within  the  compo¬ 
nent  between  t'  and  tf.  Consider  the  final  local  reinitializa¬ 
tion  generated  by  a  component  node  (ties  can  be  broken 
arbitrarily),  and  say  this  occurs  at  time  rllit.  No  increases, 
failures,  or  activation  of  new  links  can  occur  after  rlasI,  the 
component  is  guaranteed  to  be  connected  after  this  time, 
and  each  node  of  the  component  must  receive  this  final 
local  reinitialization  by  some  time  tc. 

Let  i  be  a  node  of  the  component,  let  /  s*  rf.  and  consider 
any  d,(i.k:  If  d,(i,k;  y,)  =  oc.  then  clearly  (7)  is  satis¬ 

fied.  Suppose,  instead,  that  d,(i.k :  y,)<».  If  y,  =  k.  then 

d,(i.  k\  y, )  =  d,{i.  k,  k)  —  s,(  i .  k)>d*(i.k). 

But  if y,  =*=  k.  then 

d,{t.k:  y,)  =  s,(i,  y, )  d,fjx,k), 

where  r>rl>rl4Jl,  because  for  the  distance  from  y,  to  be 
accepted  by  /.  node  y,  must  have  received  the  final  local 
reinitialization  and  sent  it  to  node  i.  Now. 

dt,(  j\<k)  =  d,,(y, ,  k;  j2) 

for  some  node j2,  which  again  is  either  a  direct  distance  if 
j2~  k.  or  otherwise 

d,,(  J\-k\  j2 )  =  slt(  y,  ,j2)~  d,f  j2.k),  ^ 

where  for  the  same  reason  as  before,  t,  >  35  rlwI.  Con¬ 

tinuing  in  this  manner,  we  see  that 

d,(i.k-,j[)=s,(i.jl)  +  su(jl,j2) 

+  Js)~  ■  ■  ■  ~ ;w.  k  i. 

where  t  >  r,  >  r;  >  •  •  •  >  tm  » rlasl.  Since  only  decreases  in 
link  length,  and  no  other  topological  changes,  can  occur 
after  rlj5l,  letting  k  =  jm _ ,  we  have 

for  all  /.  which  implies  that 

d,(i'k;  j,)>s,(i.  j,)  +  st(j',.  j2) 

+  *,(hJi)+  ■■■  ~s<(jm-k) 
>df(Lk). 

Thus,  for  each  component,  a  time  /  can  be  found  for 
which  Lemma  4  is  satisfied  within  the  component.  Maxi¬ 
mizing  tc  over  all  components  yields  the  desired  t  for  the 
entire  network. 

Lemma  5:  No  particular  local  reinitialization  message 
can  cause  any  node  to  reinitialize  more  than  once. 

Proof:  The  proof  is  by  contradiction.  Suppose  that 
some  local  reinitialization,  call  it  LR 1.  does  cause  a  node  to 
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reinitialize  twice.  Then,  of  all  nodes  tha.  reinitialize  twice 

Ilue  to  LR 1.  let  node  i  be  the  first  that  reinitializes  a  second 
ime.  and  say  that  this  occurs  at  time  (Ties  are  broken 
arbitrarily.)  This  action  by  node  i  can  only  be  caused  by 
'he  receipt  of  LR\  from  some  neighbor,  sav  j.  Since  node  j 
Is  assumed  not  to  have  reinitialized  twice  due  to  LR\ 
oefore  tr  node  j  must  be  transmitting  LR\  for  the  first 
ume.  Thus,  when  node  i  first  received  LR\,  say  at  ume 

|f,rs!.  it  could  not  have  been  sent  by  node  j. 

Case  A:  Suppose  nodes  i  and  j  were  neighbors  at  time 
/firM  and  remained  neighbors  at  least  until  time  r,.  Receipt 

Iif  LR\  by  node  i  caused  an  embargo  to  be  placed  on  node 
The  LR\  message  received  from  j  by  i  at  being  the  first 
such  transmission  from  node  j.  causes  the  removal  of  the 
j'mbargo,  but  not  the  reinitialization  by  i  that  was  assumed. 
;  iy  contradiction,  nodes  i  and  j  were  not  neighbors  at  /firsl. 

Case  B:  If  /  and  j  were  not  neighbors  dunng  the  entire 
nenod  from  rfir5t  to  r,,  then  i  and  j  became  neighbors  at 


ome  time  r„ 


It.  where 


^ first  <  rn 


,  +  A  t<t,. 


(8) 


.ince  no  node  reinitializes  twice  before  the  transmission 
of  LR  1  by  node  j  must  be  the  result  of  a  sequence  of  nodes, 
beginning  with  the  initiator  of  LR\.  each  transmitting  LR\ 
or  the  first  time.  This  path  from  the  initiator  of  LR]  to 
ode  j  must  be  loopless.  If  tLRl  is  the  time  at  which  LR\ 
was  generated,  then  by  the  definition  of  A/, 

ll.RI  **  ri  **  lLR\  _r  A t ■  (9) 

Also. 

iLR\  **  rtinl  ^  lLR\  +  Af-  (10) 

Then,  from  (8)-(  10).  we  get 

<  t,  ~  Ar  <  tLR]  <  ifirsl  <  fn„  +  Ar.  (11) 

in  other  words.  rfinI£  [r^.  r^  ~  Ar ),  so  that  the  index 
orresponding  to  LR  1  is  stored  by  node  i  until  time  rnew  + 
Ar.  as  specified  by  the  mechanism  for  adding  links.  Also, 
'rom  ( 1 1 )  we  get 


-2A  /. 


links  connecting  them  remains  active,  the  neighbor  will 
receive  and  return  the  local  reinitialization  message,  or  will 
send  this  message  to  the  node  as  a  result  of  receiving  the 
message  from  one  of  its  other  neighbors,  thus  removing 
the  embargo.  If  the  pair  of  links  fail,  the  node  removes  the 
neighbor  from  its  set  of  neighbors,  eliminating  the  em¬ 
bargo.  By  Lemma  5,  the  removal  of  an  embargo  is  perma¬ 
nent. 

Lemma  7:  There  exists  a  time  tE.  after  which  there  are 
no  embargoes  anywhere  in  the  network. 

Proof:  By  Lemma  6.  each  embargo  has  a  permanent 
removal  time.  There  are  a  finite  number  of  topological 
changes  that  occur  between  t‘  and  tf.  thus  a  finite  number 
of  embargoes  placed.  Maximizing  the  removal  times  over 
all  embargoes  yields  the  desired  tE. 

Lemma  8:  Let  T=  max  [tf,  t,  tE).  Then  after  time  T. 
the  dynamic  algorithm  is  equivalent  to  the  static  algorithm. 

Proof:  No  local  reinitializations  can  be  generated  after 
tf.  Any  embargoes  that  were  placed  have  been  removed  by 
time  te.  The  only  operations  performed  after  time  T  are 
the  addition-comparison  operations  and  subsequent 
transmission  of  better  shortest  distances,  as  in  the  static 
algorithm. 

Theorem  2:  Under  the  given  conditions,  there  exists  a 
time  T  after  which  each  entry  in  the  distance  cube,  i.e.. 
each  current  shortest  distance  and  each  distance  assess¬ 
ment,  is  optimal  with  respect  to  the  final  topology. 

Proof:  We  individually  prove  that  the  conditions 
specified  in  Theorem  1  are  satisfied. 

Condition  I:  T>tf  implies  that  no  topological  changes 
occur  in  the  network  after  time  T. 

Condition  2:  By  assumption,  a  node  always  knows  the 
distance  to  any  neighbor,  and  uses  that  distance  for  the 
distance  assessment  via  that  neighbor. 

Condition  3:  Consider  any  d,(i.  k\  j ).  where  t>T.  If 
node  i  received  no  local  reinitializations  between  t'  and  t,. 
then  the  only  type  of  topological  change  that  can  occur  is 
link  length  decrease.  If  node  i  constructed  d,(i.  k\  j)  based 
on  a  distance  sent  by  node  j  at  time  r;  <  t.  then 

d,(i.k\  j)=s,(i.  j)+  d,(j.k) 

>s,{i.  j)-rd,(j.  k) 


•"  that  at  when  i  receives  LR\  from  j.  the  index 
orresponding  to  LR  1  will  be  found  in  the  set  of  stored 
ndexes.  and  node  i  will  not  reinitialize  at  r,,  again,  a 
ontradiction.  Therefore,  no  local  reinitialization  message 
in  cause  any  node  to  reinitialize  more  than  once. 

Corollan •  ! :  Once  a  particular  embargo  is  removed  from 
neighbor  by  some  node,  the  node  can  never  place  another 
nbargo  with  the  same  index  on  that  neighbor. 

Proof:  An  embargo  can  be  replaced  only  if  a  second 
.initialization  due  to  the  same  local  reinitialization  mes- 
ge  occurs  which,  by  Lemma  5.  cannot  happen. 

Lemma  6.  Every  embargo  that  is  placed  is  eventually 
moved. 

Proof  Suppose  a  node  embargoes  distance  mforma- 
n  from  some  neighbor.  The  corresponding  local  reini- 
ilization  must  then  be  sent  to  that  neighbor.  If  the  pair  of 


because  current  shortest  distances  cannot  increase  in  the 
static  algorithm.  If  d,(i,k ;  j)  is  not  constructed  from  in¬ 
formation  received  from  j.  then  its  value  differs  from 
d,  (i.k ;  j)  only  if  one  or  more  decreases  in  link  length 
r(  i.  j )  occurred  between  t '  and  t.  That  is. 

d,(t.k\  j)  =  d,\  i.  k;  j)  —  [  j,.(  i,  j  )  -  s,(  i,  j  )] 

s*j,.((,  j)+d,.(i.  j)-s,  (t ,  j)  +  s,U.  j  ) 
>s,(i.  j)  +  d,{j,k). 

If  a  reinitialization  was  received,  then  d,u.  k\  j)  is  either 
infinite  or  is  constructed  from  information  sent  by  j.  in 
which  case,  again 

d,(i.k\  j)>  r,(i.  ;  )-*-  dft.k). 


( Continued  )  (c)  Activating  new  links  and  (d)  detecting  a  branch  length  decrease 
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Condition  4:  Proof  by  Lemma  4. 

Condition  5:  Consider  any  dT(i,k).  If  this  value  is  at¬ 
tained  at  time  T ,  then  node  i  must  transmit  this  distance  to 
each  of  its  neighbors.  If  the  value  is  attained  prior  to  7". 
then  d(i,  k)  has  remained  constant  since  some  time  before 
T.  Node  i  was  required  to  send  the  distance  to  each  of  its 
neighbors  at  that  time,  and  may  have  been  required  to 
make  subsequent  transmissions  of  dT(i,k)  due  to  local 
reinitializations.  Let  t,  be  the  largest  at  which  node  i 
was  required  to  transmit  dT(i ,  k). 

At  t,,  node  i  transmits  to  each  node  that  is  its  neighbor 
at  that  time.  If  any  neighbor  of  /  at  time  T  was  not  a 
neighbor  of  i  at  t,,  then  connecting  links  were  activated 
between  t,  and  T,  causing  node  i  to  reinitialize,  and  requir¬ 
ing  that  node  /  transmit  dT(i,k)  after  the  assumed  final 
time  of  t,  .  By  contradiction,  the  neighbors  of  /  at  time  t,  are 
identical  to  the  neighbors  of  i  in  the  final  topology. 

Suppose  one  of  these  neighbors  of  /,  say  node  j,  rejects 
the  final  transmission  of  dT(i,k)  due  to  an  embargo  on 
node  /.  Since  embargoes  must  all  be  removed  by  T,  node  j 
must  receive  a  local  reinitialization  message,  accompanied 
by  distance  dT(i,  k),  which  was  sent  by  node  i  after  the 
assumed  final  time  t,,  again  a  contradiction.  So  any  node  j 
receiving  the  final  transmission  of  dT(i,k)  before  time  T 
must  accept  the  value,  and  have 

dT(  j,k\i)  =  sT(j,  i)  +  dT(i,k) 

=  sf(j,i)+dT(i.k). 
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as  a  backward  equation,  so  that 


B  I)  O  ANIil  RSON  AND  I  0  KIIODIS 
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and  (he  corresponding  backward  equation  is  1  Accordingly,  wilh  i\u  and  />,„  denoling  inmormali/cd  densilics. 
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Now,  (A  I)  ami  (A .2)  arc  both  forward  equations  to  which  the  forward 
filtering  equation  can  be  applied  Thus,  see  (4  8) 
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SUMMARY 

The  product  of  the  observability  and  reachability  gramians  of  a  linear  sys¬ 
tem,  denoted  MW,  is  a  matrix  whose  eigenvalues  are  non-negative  and  are  invari¬ 
ant  under  coordinate  transformations  in  the  state  space  [4],  The  second-order 
modes  of  the  system—  the  square  roots  of  these  eigenvalues—  can  be  interpreted 
as  measures  of  the  mean-square  energy  throughput  in  appropriately  defined 
channels  [3,6  and  others].  The  sum  of  the  eigenvalues  also  arises  in  a  natural 
way  as  a  continuous-valued  measure  of  interaction  for  purposes  of  decoupling, 
or  nearly  decoupling,  a  system  with  several  input  and  output  channels:  define 
measures  of  reachability  and  observability,  respectively,  on  the  state  space  by 
R(x)=%xT  Wx  and  0(x)  =  %zT  Mx  [2].  Then  let  a  measure  of  interaction  between 
the  system  inputs  and  outputs  be  defined  as 

where  /?*(x)  denotes  the  conjugate  functional  J£x  T  W  _1z ,  and  where  the  expecta¬ 
tion  is  with  respect  to  the  uniform  density  on  \R‘(x)=  lj.  It  is  readily  seen  (as 
in  [7])  that  I  =  1/n  tr  MW.  For  convenience,  we  shall  use  the  measure  tr  MW. 

We  present  results  for  near-disturbance  localisation  in  discrete-time  sys¬ 
tems  based  on  satisfaction  of  a  first-order  necessary  condition.  Consider  a  sys¬ 
tem 

xtfl  =  Axk  +  B  ufc  -H  Dvk,  (la) 

Vk  =  Cxk  (lb) 

and  consider  u as  a  local  control  input,  a  disturbance.  Define  the  problem  of 
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near-disturbance  localization  by  state  feedback  as:  find  F  -  arg  min  tr  My  Way. 
where  ()y  signifies  that  the  system  matrix  is  (A+BF),  and  where  Wpy  denotes 

e* 

the  reachability  matrix  of  the  pair  ((A+BF),D).  Consider  the  case  in  which  Ay  is 
stable.  For  the  infinite -horizon  problem  in  which  My,  Way  are  steady-state 
gramians,  the  gradient  matrix  of  tr  My  Way  with  respect  to  the  feedback  gain 
matrix  F  has  a  concise  closed-form  expression: 

~r  tr  My  W0  y  -  2  BT  (  My  Ay  y  +  A  y  Ay  Wo.y  ) 
where  F  and  A  are  weighted  graTnians  defined  by  the  steady-state  discrete-time 
Lyapunov  equations 

?D.F  -  Ay  ?D.y  AyT  +  WD  y 
Ay  -  AyT  Ay  Ay  +  My 

We  wish  to  find  F  such  that  mprtr  MyWD  y=  0.  If  there  are  at  least  as  many  con- 

Or 

trol  inputs  as  outputs,  with  B  having  full  (column)  rank,  this  condition  has  a 
solution  that  is  independent  of  D.  Let  p  be  the  number  of  outputs.  Consider 
first  the  case  in  which  the  first  nonzero  Markov  parameter  of  the  local  system 
has  full  rank:  p(CB)  =  •  •  •  =  p(CA6~zB)  =0  and  p(CAi~'B)  =  p.  (This  is  always  true 
of  single-output  systems.)  Then  it  is  easily  seen  that  F’  =  —(CAs'lB)¥  CAS,  where 
( CAs~lB )f  is  any  right  inverse  for  CAS~'B.  This  F‘  has  the  property  that 
F*S.F(  V*),  and  the  map  induced  by  {A+BF")  on  X/V*  has  all  its  eigenvalues  at 
the  origin;  this  property  is  a  characterization  of  F *  for  single-input,  single¬ 
output  systems.  Furthermore,  if  the  system  is  minimum-phase,  F‘  is  the  same 
as  the  optimal  F  obtained  by  the  "cheap  control”  technique  of  finding 

F°  =  lim  Fc° , 

£-*0 

where 

=  Ft°  xfc 

£xfcrCrCxt  +  tukTuk\ 


=  arg  min 
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This  is  equivalent  to  finding  F°  =  arg  min.  tr  C  WD  F  CT  in  the  case  of  a 
minimum-phase  system;  the  gradient  of  this  latter  interaction  measure  is  given 
by 

fjrtrC  WDiF  CT  =  2Br  (  MF  AF  WD  F  ). 

F‘  =  —  (CAi~lB)*  CAe  satisfies  both  first-order  necessary  conditions;  in  fact,  it 
satisfies  the  stronger  condition 

(C(A  +  BF')k~'  B)7  (C(A+BF‘)k)  =  0.  A:  =  1.2. .  (2) 

which  ensures  that  both  gradients  vanish  term-by-term. 

In  the  nonuniform  rank  case  in  which  m  >p>l  and  CB  =  ...=C/4*'ZB  =0, 
^0  and  p(CAs~l B)<  p,  one  plausible  extension  of  the  above  results  is  that 
the  optimal  F*  be  characterized  by  the  condition  of  eqn.  (2),  and  be  described 
by  the  property  that  l)  F’^.F{V")\  2)  the  map  induced  by  (A+BF‘)  on  X/ V*  has 
all  eigenvalues  at  the  origin;  and  3)  F'  =  ~(BT  P  B)~l  BT  P  A,  where  P  is  a  (not 
necessarily  stabilizing)  solution  to  the  Riccati  equation  P  -  A7  P  A  +  CT  C 
-A7  P  B(B7  P  B)~l  B7  P  A.  The  simulations  performed  so  far  support  this  con¬ 
jecture. 
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In  this  Section.  necessary  and  sufficient  conditions  for  uniqueness  of  the  minimal  cover  for 
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Abstract — A  mew  proof  is  prreaatad  lor  the  Inequality  tr  (XY)  3 
HXh-tr  Y.  This  argamm  is  valid  uder  tke  condition  that  Y  be  real 
fyauMtrie  aounegaltve  definite;  X  may  be  may  square  matrix. 

INTRODUCTION 

Much  work  has  been  done  in  recent  years  to  establish  bounds  on  the 
eigenvalues,  snd,  in  particular,  on  the  spectral  norm,  of  solutions  to  the 
algebraic  matrix  Riccad  equations  and  the  Lyapunov  equations  of  control 
and  estimation  theory.  The  derivations  in.  e.g..  [2]-[5]  have  either  used 
or  implied  the  fact  that,  for  X,  Y  €  R"“"  with  both  X,  Y  &  0,' 

trumsp-ll,  •  tr(n  (t) 

where,  following  [1],  K  - 1|2  denotes  the  spectral  norm  or  largest  singular 
value.  In  [2]— [61,  this  inequality  was  only  applied  to  matrices  X,  Y  that 
were  guaranteed  to  be  symmetric  and  either  nonnegative  definite  or 
positive  definite.  A  more  recent  work  on  eigenvalue  bounds  [7]  contains 
the  related  result  for  Y  »  Y'  3c  0  and  X  *  X' 

KUX)  trmstr  (ADsU-n  tr  ( JO 

which  unplies  (1).  However,  (1)  holds  whenever  at  least  the  matrix  Y  is 
symmetric  and  nonnegative  definite;  the  other  can  be  an  arbitrary  real- 
valued  square  matrix.  This  more  general  result  has  not,  to  the  authors' 
knowledge,  previously  appeared  in  the  control  literature. 

Results 

Theorem:  Let  X,  Y  €  Rm*m  with  Y  symmetric  and  nonnegative 
definite.  Then 

tr  (Jfy)s|tr(AT)|S  11^11,  •  tr(T).  (2) 

Maasrnpi  mowed  August  21.  1986;  revised  Much  16.  1987. 
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1  16)  man  both  X,  Y  >  0. 


(tr  (AT) |  s  2  k/*«/l 

I l«l  ■  \\XQ> II 

>•» 

*2  \X\i  ■  |q;P-«Jrili  •  £  l|?JJ  (4) 

i-i  /-i 

by  the  Cauchy-Schwtrz  inequality  and  the  definition  of  the  spectral  norm. 
But 


±  lk,ll’«£  £  kt/lJ 

)- I  '-I 

*  II  Cl1,; 

and  if  we  recall  that  ||2|J.-tr  Q'Q  -  tr  QQ‘ ,  (4)  becomes 
Itr  (XY) |  s  |  Xh  ■  tr  QQ'  -  || X Jj  •  tr  Y. 
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ABSTRACT 


Problems  of  decision  making  in  the  face  of  statistical 
uncertainties  have  long  been  of  interest  to  decision  theorists  in 
various  disciplines.  A  decentralized  detection  or  hypothesis¬ 
testing  system  is  a  special  case  of  a  decentralized  decision¬ 
making  system  in  which  several  local  agents  or  detectors 
observe  a  common  region  of  the  environment  to  determine, 
for  example,  the  presence  or  absence  of  a  certain 
phenomenon.  Due  to  such  factors  as  communication  con¬ 
straints,  local  detectors  must  make  decisions  at  their  local  sites 
and  send  their  decisions,  rather  than  their  received  measure¬ 
ments  or  observations,  to  a  central  processor,  who  is  responsi¬ 
ble  for  declaring  the  final  decision.  Moreover,  'he  local  detec¬ 
tors  are  not  permitted  to  communicate  with  one  another.  In 
this  paper,  we  study  a  broad  class  of  decentralized  multi-stage, 
multi-detector  binary  hypothesis-testing  problems.  It  is  shown 
that,  under  appropriate  independence  assumptions  on  the 
received  measurements,  local  strategies  at  each  time  instant 
are  given  by  threshold  tests  on  the  likelihood  ratio.  Further¬ 
more,  it  is  shown  that  local  decisions  of  each  detector  depend 
not  only  on  his  present  and  past  observations,  but  on  his  past 
decisions  as  well.  That  is,  for  each  local  detector  there  is  a 
different  threshold  corresponding  to  each  combination  of  past 
decisions. 


L  INTRODUCTION 

Problems  of  decision  making  in  the  face  of  statistical 
uncertainties  have  long  been  of  interest  to  decision  theorists  in 
various  disciplines.  A  great  deal  of  attention  has  been  given 
to  these  problems  in  order  to  cope  with  uncertainty.  Detection 
theory  is  one  area  of  decision  making  that  has  received  much 
attention,  particularly  in  surveillance  systems.  The  well- 
known  theory  of  classical  detection  (e.g.,  (Van  Trees,  1968, 
1971])  has  been  motivated  by  single-sensor  detection  prob¬ 
lems.  The  situation  is  quite  different  when  physically 
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distributed  sensors  obtain  measurements  from  a  common 
region  of  the  environment  in  order  to  determine  the  presence 
or  absence  of  a  certain  phenomenon. 

Even  though  classical  detection  theory  is  equally  applica¬ 
ble  to  such  multi-sensor  detection  problems,  in  practice  there 
may  be  a  need  to  communicate  received  raw  data  from  geo¬ 
graphically  dispersed  locations  to  a  central  processing  unit. 
Such  communication  capability  might  not  be  feasible  for  a 
number  of  reasons  that  we  will  discuss  shortly.  In  such  a  case, 
each  of  the  local  processors  may  make  a  local  decision  and 
send  this  decision,  rather  than  the  measured  data,  to  the  super¬ 
visor  or  fusion  center  where  these  local  decisions  are  used  to 
make  an  overall  final  decision.  This  problem  is  sometimes 
referred  to  as  a  decentralized  (or  distributed)  detection  prob¬ 
lem  since  central  processing  of  all  measurements  is  not  avail¬ 
able. 

Distributed  detection  is  gaining  increasing  practical 
significance,  especially  in  surveillance  systems.  Although  the 
performance  of  decentralized  detection  is  suboptimal  com¬ 
pared  to  centralized  detection,  it  is  in  many  cases  a  more  real¬ 
istic  formulation  in  practical  applications  than  its  centralized 
counterpart.  For  example,  as  discussed  above,  in  the  face  of 
capacity-constrained  channels,  local  processing  could  substan¬ 
tially  decrease  the  communication  bandwidth  to  transmit  raw 
data  to  a  central  site,  thereby  speeding  up  the  process  and 
reducing  the  transmission  cost.  Also,  decentralization  may  be 
the  natural  way  to  model  the  problem  in  situations  where  there 
are  multiple  detectors  of  various  nature  or  located  at  dispersed 
geographical  sites.  Furthermore,  decentralized  detection  may 
just  be  imposed  in  situations  where,  due  to  enormous  amounts 
of  available  raw  data,  centralized  processing  of  the  informa¬ 
tion  is  not  feasible.  Other  issues  include  potential  system  reli¬ 
ability  and  integrity  in  the  face  of  failures. 

The  problem  of  decentralized  decision  making  can  also  be 
viewed  in  the  framework  of  decentralized  optimal  control 
theory  [Sandell,  '/araiya,  Athans,  &  Safonov.  1978],  [Tenney 
&  Sandell,  1981a,  1981b,  1981c].  However,  unlike  most 
decentralized  control  problems,  the  decentralized  hypothesis¬ 
testing  problem  can  be  tackled  in  a  relatively  straightforward 
manner  leading  in  many  cases  to  explicit  decision  rules.  This 
inherent  simplicity  is  due  to  the  absence  of  any  feedback  in 
the  system  [Tenney  &  Sandell,  1981a],  (Tenney,  1982],  so 
that  decisions  made  by  one  local  processor  do  not  alter  the 
system  dynamics  and  have  no  influence  on  the  information 
available  to  other  processors. 

From  another  standpoint,  decentralized  hypothesis  testing 
can  be  viewed  in  the  framework  of  team  theory  [Radner, 
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1962],  [Ho  &  Chu,  1972],  [Ho,  1980);  a  “team”  of  local  pro¬ 
cessors  take  observations  regarding  a  certain  phenomenon, 
each  of  which  must  make  an  independent  decision  based  on 
his  own  information  and  pass  this  decision  on  to  a  central  pro¬ 
cessor,  who  is  also  a  member  of  the  team. 

There  are  a  variety  of  ways  in  which  one  can  pose  a 
decentralized  detection  problem.  Each  formulation  has  ele¬ 
ments  that  capture  one  or  all  of  the  following  three  features  of 
decentralized  detection  and  decision-making  problems: 

(a)  Hierarchical  Structure:  Local  observations  are  pro¬ 
cessed  and  the  result  sent  to  a  fusion  center  where  a  final 
or  “global”  decision  is  made.  There  may  or  may  not  be  a 
direct  communicadon  between  the  local  observers. 

(b)  Multi-Stage  or  Sequential  Structure:  Addidonal  meas¬ 
urements  can  be  taken  by  the  local  observers.  Each  local 
observer  might  wait  until  he  elects  to  stop  taking  measure¬ 
ments  before  sending  information  to  the  fusion  center,  or 
information  might  be  sent  after  each  observation  and  the 
fusion  center  given  the  flexibility  to  determine  when  to 
declare  a  decision. 

(c)  Information  Rate  or  Bandwidth  Reduction:  The 
preprocessing  performed  by  each  local  observer  should 
result  in  a  significant  reduction  in  the  amount  of  data  sent 
to  the  fusion  center.  Each  local  processor  may,  for  exam¬ 
ple,  make  a  decision  at  his  local  site  and  send  his  decision 
to  the  fusion  center  instead  of,  say,  his  observation  (as  in 
centralized  detection)  or  even  some  sufficient  statistics. 
The  object  of  this  paper  is  to  analyze  a  broad  class  of 

multi-stage  distributed  detection  problems  that  extends  the 
results  of  [Hashemi  &  Rhodes,  1987],  We  are  interested  in 
limited  communications  due  to  such  factors  as  limited  channel 
capacity,  transmission  cost,  etc.  In  other  words,  we  are  deal¬ 
ing  with  communication-constrained  problems.  In  Sections 
III  and  IV,  we  shall  thoroughly  investigate  a  multi-stage, 
multi-person  decentralized  binary  hypothesis-testing  problem 
in  which  each  detector,  after  obtaining  each  of  his  observa¬ 
tions,  sends  a  binary  decision  (0  or  1 )  to  the  fusion  center  or 
the  “supervisor,”  who  is  given  the  option  of  declaring  a  final 
decision  (i.e.,  deciding  which  of  two  possible  hypotheses,  H0 
or  //j,  is  true)  at  either  stage  depending  on  the  local  decisions 
that  have  been  received.  There  is  a  cost  associated  with  each 
combination  of  the  true  state  and  the  final  decision.  In  addi¬ 
tion,  a  cost  is  incurred  for  delaying  the  final  decision  until  the 
next  time  instant. 


It  is  shown  that,  under  appropriate  independence  assump¬ 
tions,  the  optimal  local  strategies  are  governed  by  threshold 
tests  on  the  likelihood  ratio  (i.e.,  likelihood-ratio  tests),  where 
the  thresholds  are  off-line  computable.  This  is  the  major  con¬ 
tribution  of  this  paper.  The  importance  of  this  result  is 
apparent  when  one  recalls  that  decentralized  detection  prob¬ 
lems  are  in  general  NP-complete  [Tsitsiklis  &  Athans,  1985], 
In  other  words,  these  independence  assumptions  make  tract¬ 
able  an  otherwise  intractable  problem. 

ie=0 

The  notation  “/( Jt)  £  t"  is  the  standard  decision  theory 


notation  meaning  that  u= 0  when  u=l  when  f(x)c , 

and  u=0  or  1  when  For  notational  convenience,  let 

Ex^  t/fx )]  —  E[/(z)ly]  denote  the  conditional  expectation  of 
fix)  given  y.  For  convenience,  we  shall  use  the  notation 
pi x\y)  for  PxiyOtly);  i.e.,  the  arguments  are  used  to  denote  dif¬ 
ferent  functions.  It  is  also  assumed,  for  convenience  and  to 


simplify  the  development,  that  all  conditional  densities  exist. 
Wf  will  assume  that  the  set  of  states  of  nature  contains  two 
elements,  called  hypothesis  H 0  and  hypothesis  H , ,  and  that  H 
refers  to  H0  or  H\.  The  symbol  “J”  is  used  to  denote  the  cost 
function,  and  X  to  denote  the  likelihood-ratio  function, 
p{HrSX)  “=« 

X(;0  =  — —  —  .  The  test  X(X)  £  t  is  called  a  likelihood- 
p(H,IX)  «=t 

ratio  test  and  the  constant  t  is  called  a  threshold. 

Briefly,  the  rest  of  the  paper  is  organized  as  follows.  In 
Section  IL  we  briefly  survey  the  existing  decentralized 
decision-making  algorithms  in  the  literature.  Section  m  intro¬ 
duces  the  problem  statement  and  furnishes  some  preliminary 
development.  The  solution  to  the  general  decentralized  detec¬ 
tion  problem  is  provided  in  Section  IV.  Finally,  Section  V 
presents  some  concluding  remarks. 


IL  PRELIMINARIES 

Several  classes  of  decentralized  detection  problems  have 
been  studied  over  the  past  few  years.  A  one-stage  decentral¬ 
ized  hypothesis-testing  problem  has  been  examined  in  [Ten¬ 
ney  &  Sandell,  1981a],  in  which  each  of  the  two  local  detec¬ 
tors  takes  a  single  observation,  makes  a  local  decision  as  to 
which  of  the  two  hypotheses  is  true  and  sends  this  decision  to 
a  “fusion  center”  when  an  overall  decision  is  made.  There  is 
no  communication  between  the  local  detectors  and  their  obser¬ 
vations  are  statistically  independent.  The  pcrson-by-person 
optimal  local  strategies  shown  to  be  given  by  threshold  tests 
on  the  likelihood  ratio.  Some  extensions  of  the  above  prob¬ 
lem  have  also  been  reported  in  [Lauer  &  Sandell,  1982a]  and 
[Lauer  &  Sandell,  1982b].  In  [Ekchian  &  Tenney,  1982]  a 
one-step  team  decision  problem  has  been  considered  where 
communications  are  allowed  among  the  decision  agents  under 
a  prearranged  causal  ordering.  The  decision  rules  are  likeli¬ 
hood  ratios  based  on  the  data,  with  thresholds  determined  by 
incoming  communicated  messages. 

The  above  problems  are  restricted  to  only  one  time  step. 
Several  authors  have  considered  decentralized  multi-step 
problems.  In  [Kushner  &  Pacut,  1982],  the  information  com¬ 
municated  by  each  of  the  two  local  detectors  to  the  fusion 
center  is  a  conditional  probability;  this  serves  as  a  sufficient 
statistic,  so  that  from  the  coordinator’s  point  of  view  the  prob¬ 
lem  is  a  centralized  one  and  his  optimal  strategy  is  a  likeli¬ 
hood  ratio  test.  Since  transmitted  conditional  probabilities  are 
real  numbers,  this  scheme  does  not  substantially  reduce  the 
communication  bandwidth  unless  the  number  of  observations 
made  by  each  local  detector  is  large. 

In  [Teneketzis,  1982],  [Teneketzis  &  Ho,  1985]  a  decen¬ 
tralized  extension  of  the  classical  Wald  problem  [Wald,  1947], 
has  been  studied  and  solved  via  dynamic  programming.  At 
every  time  instant,  each  of  the  two  detectors  decides  whether 
to  stop  and  transmit  to  the  fusion  center  his  decision  as  to 
which  of  the  two  hypotheses  is  true,  or  to  continue  until  the 
next  time  step.  A  cost  is  incurred  for  taking  each  additional 
observation.  The  person-by-person  optimal  strategies  are 
given  by  likelihood-ratio  tests  at  each  time  instant  The  thres¬ 
holds  of  the  two  detectors  are  coupled  and  can  be  determined 
by  the  solution  of  a  set  of  nonlinear  algebraic  equations. 

In  [Hashemi  &  Rhodes,  1987],  a  two-step,  two-detector 
decentralized  detection  problem  has  been  considered,  in  which 
a  central  processor,  after  receiving  the  two  local  decisions  at 
time  1,  decides  whether  to  terminate  the  process  and  declare  a 
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decision  or  to  continue  to  time  2,  by  which  point  he  must 
declare  a  decision.  Local  strategies  at  both  time  instants  a  e 
shown  to  be  given  by  likelihood-ratio  tests.  The  class  of  prob¬ 
lems  treated  here  and  in  {Hashemi  &  Rhodes,  1987]  differs 
bom  that  in  [Teneketzis.  1982]  or  [Teneketzis  &  Ho.  1985], 
by  giving  to  the  fusion  center  the  responsibility  for  determin¬ 
ing  when  the  final  overall  decision  is  declared.  Unlike  in 
[Kushner  &  Pacut,  1982],  the  local  decisions  are  binary  (0  or 
1)  and  provide  the  supervisor  with  greatly  reduced  informa¬ 
tion  about  the  local  observations. 

For  several  classes  of  distributed  team  decision  problems, 
conditions  for  asymptotic  convergence  of  each  agent’s  deci¬ 
sion  sequence  and  asymptotic  agreement  (consensus)  among 
all  agents'  decisions  have  been  reported  [Tsitsiklis  &  Athans, 
1984],  [Washburn  &  Teneketzis,  1984], 

IIL  PROBLEM  STATEMENT 

3.1Jntroduction 

In  this  section,  we  pose  a  broad  class  of  distributed  detec¬ 
tion  problems  that  possess  a  sequential  nature.  Our  formula¬ 
tion  captures  elements  of  all  three  features  of  distributed  deci¬ 
sion  making  problems  that  we  listed  in  Section  I.  The 
hierarchical  structure  involves  many  local  observers  who 
communicate  only  with  the  single  supervisor  or  “fusion 
center”  and  do  not  communicate  among  themselves.  We  con¬ 
sider  a  multi-stage  problem  in  which  the  local  decision  makers 
communicate  with  the  supervisor  at  each  time  step  and  it  is 
the  supervisor  who  determines  when  the  process  is  terminated 
and  a  final  overall  decision  is  declared,  ci.  [Teneketzis,  1982], 
[Teneketzis  &  Ho,  1985],  To  incorporate  communication  con¬ 
straints  we  consider  the  extreme  case  where  at  each  time  each 
detector  makes  a  binary  decision  (0  or  1)  and  sends  only  this 
(and  not  his  observation)  to  the  supervisor.  It  is  assumed  that 
all  transmissions  are  error  free.  We  adopt  a  Bayesian 
approach  in  which  the  common  goal  of  the  local  decision 
makers  and  the  supervisor  is  to  minimize  the  expectation  of  a 
cost  functional  that  reflects  the  correctness  of  the  final  deci¬ 
sion  and  the  time  taken  to  reach  it.  Because  direct  communi¬ 
cation  between  local  decision  makers  is  prohibited,  the 
cooperation  and  indirect  interaction  among  them  is  manifested 
only  through  this  common  team  goal  [Radnor,  1962],  [Ho, 
1980], 

An  essential  pan  of  any  decentralized  decision  or  control 
problem  is  the  specification  of  the  information  pattern 
[Witscnhausen,  1971],  viz.  the  information  that  is  available  to 
each  decision  maker  at  each  time  instant  We  consider  that 
each  local  decision  maker  has  access  only  to  his  past  and 
present  observations  and  his  past  decisions  (equivalently,  to 
his  past  and  present  observations  and  to  his  past  decision 
rules,  from  which  he  can  reconstruct  his  past  decisions),  i.e. 
each  local  decision  maker,  considered  separately,  has  a  classi¬ 
cal  information  pattern.  The  supervisor  has  access  only  to  the 
past  and  present  local  decisions  communicated  to  him. 

Without  additional  structure  this  decentralized  problem  is 
NP-complete.  By  assuming  that  the  local  observations  are 
independent  when  conditioned  on  the  hypotheses,  the  problem 
can  be  shown  to  be  tractable.  In  other  words,  the  problem  is 
not  NP-complete  and  can  thus  be  solved  in  polynomial  time 
[Hashemi  &  Rhodes,  1987],  [Papadimitriou  &  Tsitsiklis, 
1982],  [Tsitsiklis  &  Athans,  1985],  [Garey  &  Johnson,  1979], 
This  independence  assumption  is  reasonable  in  many  practical 


applications  (e.g.,  in  problems  of  detecting  a  known  signal  in 
uncorrelated  noise)  although  it  will  not  be  valid  in  some  other 
situations  (e.g.,  in  problems  of  detecting  unknown  signals). 
The  additional  structure  added  to  the  problem  is  therefore  not 
too  restrictive  and  permits  explicit  computations  of  the  deci¬ 
sion  rules. 

3.2.  Problem  Statement 

Suppose  there  are  two  possible  hypotheses,  H0  and  H  lt 
with  given  a  priori  probabilities  pj=*P  (Hjj I,  j= 0,1.  There  are 
K  detectors  (control  stations),  U',...,U*,  each  operating  for  N 
time  steps.  Detector  U*  takes  an  observation  y*  at  time  n 
and,  based  on  his  present  and  past  observations  (y  *  ,...,y*)  and 
his  past  decisions  (u*,...u<^-i),  sends  a  binary  decision 
uke(0,J}  to  the  supervisor,  who  is  given  the  flexibility  of 
either  stopping  and  declaring  a  “global”  decision  or  continu¬ 
ing  to  the  next  time  step.  (He  is,  however,  forced  to  declare  a 
decision  at  or  prior  to  the  final  time.)  This  is  an  extreme  case 
of  limited  communication  between  the  local  detectors  and  the 
supervisor.  There  is  no  communication  among  the  local 
detectors. 

To  specify  the  information  pattern,  define  the  following 
sets  for  1£a£N  and  l<k  (the  following  notation  is  adopted 
from  [Witscnhausen,  1971]): 

U^=fU . n-l}  .  U u=0  (3.1a) 

Ynjc=(  1.2 . n)  (3.1b) 

so  that 

. “ti)  .  yYu-(y\ . vH)  (3.io 

Also  define 

yk  ~(yi.-,yv)=yy„  .  . >“/$)  =  “u„u(3.id) 

The  objective  is  to  find  optimal  local  strategies  for  each 
detector  (y„,  with  1  <jiSN  and  \<JcSK)  and  optimal  strategies 
for  the  supervisor  (y„,  1  £n£N)  so  as  to  minimize  the  cost 
J(D,  where 

r^C^,Yn:  IS^S/V,  \<Jc$K)-  (3.2) 

The  local  decisions  arc  determined  as 

.  *=1 . K  (3.3) 

In  other  words,  each  detector  has  perfect  recall.  Fig.  3.1 
shows  the  network  topology  for  this  problem  and  the  informa¬ 
tion  structure  of  the  k  th  detector  at  time  n .  The  global  deci¬ 
sion  is  given  by 

. K)  (3.4) 

It  is  assumed  that  the  joint  conditional  p.d.f. 
ply1  ,...,yK  \H)  is  known  a  priori.  There  is  a  cost  associated 
with  each  combination  of  the  true  state  of  nature  and  the  glo¬ 
bal  decision  (corresponding,  for  example,  to  false  alarm, 
missed  detection,  etc.).  The  cost  incurred  by  declaring  H, 
(i.e.,  u=i )  when  Hj  is  true  is  denoted  by  c,y2 0.  We  assume 
that  Cto^Cgo  and  c012c  u,  that  is,  the  cost  of  erring  is  at  least 
as  large  as  the  cost  of  no  error  when  the  same  hypothesis  is 
true.  In  addition,  there  is  an  additive  delay  cost  of  Co^O  if  the 
supervisor  postpones  his  decision  until  the  next  time  instant. 
3J  Some  Preliminary  Results 

First  we  introduce  a  crucial  independence  assumption 
which  is  used  throughout  the  remainder  of  the  paper. 
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Assumption  3.1 

Provided  the  indicated  p.d.f  s  exist,  we  assume  that 

Piy'y2 . yK\H)=p(yl\H)p(y2\H)  ■  plyK\H) 

and 

p(yk\H)  =p(y\\H)  ■  ■  ■  piyfrW) 
for  both  hypotheses  H=H  0  and  H ,  and  all  1  <Jc£K . 

As  was  mentioned  earlier,  this  assumption  is  reasonable  in 
many  practical  situations  (e.g.,  in  problems  of  detecting  a 
known  signal  in  uncorrelated  noise  and  in  circumstances 
where  the  sensors  are  geographically  dispersed).  It  results  in 
the  following  lemma  will  be  used  in  the  next  section. 

Lemma  3.1 

Let  x  and  y  be  two  random  variables  whose  joint  density 
function  is  given,  and  let  f  (xy)  be  a  function  of  x  and  y . 
Suppose  i  is  an  extraneous  random  variable  whose  joint  den¬ 
sity  function  withx  andy  is  well  defined.  Then 

E  \f  troO]  =  Er  E*  E*,  [f  (xy)]  (3.5) 

Proof.  It  is  obvious  that,  by  using  the  smoothing  property  of 
conditional  expectations,  we  have 

E  [f  Cxy)]  =  Ez,  [f  (xo0]  =  Elyj  [f  (x,y)]  =  E,  E,te  [f  (x.y)] 

We  wish  to  show  that  the  innermost  expectation  on  the  right 
side  of  this  equation  need  not  be  conditioned  on  z.  To  see 
this,  note  that 

Ex.,  [f  (x,y)l  »  E,  E*,  [f  (x,y)] 

Now,  let 

g(y)  *  E ,1,  [/  Uo')l 

Then,  after  mtroducing  the  extraneous  variable  z  and  using 
the  smoothing  property,  we  obtain 

E^,  If  (xy))  =  E,  g(y)  =  Ej  E,tt  g(y) 

which  is  what  we  intended  to  show.  □ 

One  explanation  for  Lemma  2.2  is  that  /  in  the  lemma  is 
only  a  function  of  jt  and  y ,  and  not  of  z . 


IV.  RESULTS 

In  this  section,  we  extend  the  results  of  [Hashemi  & 
Rhodes,  1987]  and  solve  the  general  multi-stage  dynamic 
hypothesis-testing  problem  posed  in  the  previous  section.  To 
find  the  local  strategies,  we  pose  the  problem  in  the  frame¬ 
work  of  a  dynamic  team  problem  and  use  the  Bayes  criterion 
to  find  the  person-by-person  optimal  (PBPO)  solutions  [Ho  & 
Chu,  1972],  (Ho,  1980],  Our  aim  is  to 

min  J(T)  -  min  E,,  J[ytYnV);  n=l,...M;  k=l....JC)i4A) 

Eq.  (4.1)  is  in  the  so-called  strategic  form  of  the  problem. 
Using  properties  of  nested  conditional  expectations  and  invok¬ 
ing  Assumption  3.1,  we  obtain 

nun  E,i  EHiyt  Ey.  ».i  JJy*(yk);  h=l,...Jf]  (4.2) 


where  we  have  defined 

J7[y*;k=l,...,/n  &  J[7(Yf;  ifUK;  *=1  (4.3a) 

and 

71  -  (7* - Yv)=Y)-m  (4.3b) 

Unfortunately,  very  little  can  be  said  about  the  globally 
optimal  solution,  and  thus  we  focus  our  attention  on  PBPO 
solutions  [Ho  &  Chu,  1972],  [Ho,  1980],  To  find,  say,  U*’s 
strategies,  fix  the  strategies  of  all  the  other  decision  makers  at 
their  optima  and  minimize  (4.2)  over  y* .  Since  the  optimum 
strategies  y1  ,j*k,  are  given,  the  decisions  u1  ,j*k,  become 
well-defined  random  variables  and  (4.2)  reduces  to  the  minim¬ 
ization  over  7*  o. 

E,»  E^  E,, J^ut,...Jt*'l,Yf(y*),u*+I . uKJf) 

This  is  a  non-trivial  function  minimization  problem  faced  by 
U* ,  but  it  can  be  reduced  to  the  following  equivalent  parame¬ 
ter  minimization  problem  by  using  the  fact  that  finding  the 
optimal  uk  for  every  y,^  and  is  equivalent  to  determin¬ 
ing  the  optimal  strategy  y*  *,  with  ISnSA/  [Ho,  1980]: 

E,t  min  E,5l>{  min  •  •  •  min  EHly. 

111  lljlH| 

where  we  have  defined 

l+u\...jiK  JH)  (4.5) 

From  (4.4)  we  can  deduce  U’s  strategy  at  the  final  time. 


Strategies  at  time  N 

The  inner  minimization  in  (4.4)  corresponds  to  the  final 
time.  Since  uk  e  {0,1},  U*  ’s  strategy  at  the  final  time  is  given 
by 

Ehi,*  i . un- t  I  t,*  ^*v(u* Mn-\  fiJf) 

’  vi- 1 


which,1  after  expanding  over  H  and  rearranging,  yields 

p(H0\yk)  (Ev<“tw  l^o)-^(“u„  .OJi o)l 
■ d=« 

£  p(H0\yk)  0-Zb(uVHi,Ul ,))  (4.6) 


The  above  describes  the  PBPO  strategy  of  Detector  U*  at 
the  final  time.  If  we  assume  that  the  expression  within  the 
brackets  on  the  left  side  of  the  above  inequality  is  positive, 
then  we  can  put  (4.6)  in  a  more  convenient  form,  viz. 


p(H  ib>yw) 


«A=o 
“*  =  1 


(4.7) 


where 

k  A  Zl(uVKi,0J10-'Lk„(uUHi,\Jl]) 
'v(UUvj)  ~  Lfttuu^Uio)  -  Z6(uUKt.OMo) 


(4.8) 


As  can  be  seen,  PBPO  local  strategies  at  time  N  are  given 
by  likelihood-ratio  tests.  We  shall  proceed  to  obtain  the  local 
strategies  for  time  n  <N . 


Strategies  at  time  n  <  N 

W;  first  prove  the  following  lemma  using  the  results  ot 
Lemmi  3.1. 
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Lemma  4.1 

Under  Assumption  3.1,  we  hove 
E[J]  =  E.. . a'M  (J^u1 . us  M)}  (4.9) 

=  ^Wtyr-  _ H  ^ 

That  is,  the  innermost  expectation  need  not  be  conditioned  on 

Proof.  After  invoking  Assumption  3.1,  the  proof  parallels  the 
proof  of  Lemma  3.1.  □ 


It  is  important  to  note  that  since  in  (4.9)  is  available 
to  U*  at  time  n  (because  U*  has  perfect  recall),  we  have 

P(“vjyr~„)  =  1 

for  the  available  u^.  As  a  result.  (4.9)  further  reduces  to 

E>.  E-Myr.  2*  <«  •*>  <4'  H>) 

where  we  have  defined 

E Z5(ukJf)  (4.11) 

Now,  since  u*  e  {0,1}  andp(u*=OI»M)  =  1  - p(u*=\\yYJ,  Eq. 
(4.10)  can  be  written  as 

E»„  I  (Pi^yrJ  EWyr_  Z^u^/1)}  (4.12) 

«f= o 


p<«  *=  1  ly  r<- )  M-Cy  ) 


+  EW  I„*(uUo,0 JO) 


where  we  have  defined 


Wyr„>  =  E«,r.  {Zk(u^,UD-zt(u^,ojr>} 

Noticing  that  the  last  term  in  (4.12)  is  constant  with  respect  to 
P(“*b'y0)*  *e  problem  for  Detector  U*  at  time  n  becomes 
one  of  minimizing  E^_  /p(u*=llyyM)  fi(yy^)jl.  Clearly,  this  is 
minimized  when 


P(un=^yyJ  = 


0 

1 


if  fi(yyM)>0 
if  fi(yy^,)<0 


or,  equivalently,  when 

“1=0 

dO-yJ  £  0  (4.13) 

«l=t 

After  expanding  p(yy0)  over  H  and  collecting  appropriate 
terms,  we  obtain 

uM .  I  MohZfo  Um  ,0/1 0)1  p(H0tyY„ ) 

“l=i 

J(  (4.14) 

The  above  describes  the  strategy  of  U*  at  time  n .  As 
before,  if  we  assume  that  the  expression  within  the  brackets  on 
the  left  side  of  the  above  inequality  is  positive,  then  the  PBPO 
strategy  for  U*  at  time  n  can  be  represented  in  a  more  con¬ 
venient  way  by  a  likelihood-ratio  test,  viz. 

*  P(H  nlyy  )  «i=o 

7  J,  *•«->  ■  WA“^15> 


where  the  threshr  Id  is  given  by 

kl  &  Zk{uu  0/1  ^ -I*(«u  A/1 ,) 
tkiu  u  )  d  — - t - - - ^ -  (4.16) 

^  S„‘(uUu,WoJ'2>u.(,Ofi0) 

(For  n=l,  tk(uv^)=tkl .) 

In  short,  under  Assumption  3.1,  PBPO  local  strategies  at 
each  time  instant  have  been  shown  to  be  described  by  thres¬ 
hold  tests  against  the  likelihood  ratio. 

Remark  4.1.  It  can  be  shown  by  natural  extensions  of 
[Hashemi  &  Rhodes,  1987]  that  each  detector’s  thresholds  at 
each  time  instant  are  coupled  with  the  other  detectors’  thres¬ 
holds  at  all  time  instants,  as  well  as  his  own  future  thresholds; 
furthermore,  they  are  parameterized  by  the  supervisor's  stra¬ 
tegy.  The  coupling  of  the  thresholds  is  indicated  below.  First 
define  the  threshold  vector 

t*-(tf(«u.,,):  l^n^N)  ,  k=l,...X  (4.17) 

Then,  we  have 

tk  =  f*(£i 4 vi  1‘ t*)  (4.18) 

or  more  compactly 

t*=f*(t‘ ti_1,t*+1 t*)  (4.19) 

In  words,  the  thresholds  are  coupled  with  each  other,  however, 
thanks  to  Assumption  3.1,  they  can  be  computed  off-line  once 
and  for  all  [by  solving  a  system  of  nonlinear  equations  of  the 
form  (4.18)].  The  functions  f*  and  I*  are  also  parametrized  by 
the  supervisor’s  strategy  y  =  . yN ). 

Remark  4.2.  It  is  interesting  to  sec  that  each  detector’s  thres¬ 
holds  at  each  time  instant  depend  on  his  choice  of  past  deci¬ 
sions;  i.e.,  for  each  combination  of  past  decisions,  there  is  a 
different  threshold  for  each  detector.  In  other  words,  U*’s 
decision  at  time  n  depends  not  only  on  yY^,  but  also  on  ujj^. 
That  is, 

=  Y*Cyr^  •  “  uj  (4'2°) 

This  leads  to  2"~l  thresholds  for  U*  at  time  n .  which  are 
denoted  tk{u  u<4).  Of  course,  once  U*’s  strategies  at  time  n 

are  given,  then  one  can  identify  a  region  in  the  y\ . 

hyperplane  over  which,  say,  u*=0  is  sent  regardless  of 
In  Fig.  4.1,  this  situation  is  depicted  at  time  2.  The  decision 
regions  at  times  1  and  2  are  shown  for  a  typical  example 
where  a  certain  monotonicity  is  assumed  (e.g.,  consider  Gaus¬ 
sian  distributions  having  the  same  standard  deviations  but  dif¬ 
ferent  means  under  H  o  and  H  {).  It  is  important  to  emphasize 
here  that  these  decision  regions  (i.e.,  the  two  regions 
((y\,y\)'M2=i},  i=0,l)  may  not  lend  themselves  to  a  thres¬ 
hold  test  To  put  it  another  way,  in  general  no  single  threshold 
can  completely  express  U*  ’s  optimal  strategy  a  tune  2.  How¬ 
ever,  if  one  divides  the  y  *  ,  y  \  plane  into  two  regions 

*.»  -{(y\A) :  Y*(y f)  =«i  J  (4-21) 

for  u *  =0,1  as  in  Fig.  4.1,  then  over  each  of  these  two  regions. 
A0  and  A),  the  decision  uj  is  decided  via  a  likelihood-ratio 
test  with  thresholds  tk(0)  and  r*(  1).  respectively.  This  is  why 
at  time  2  we  need  two  thresholds  and,  in  general,  at  time  n  we 
need  2"-1  thresholds  for  each  detector. 

Also  note  that  the  knowledge  of  y\  and  yf  is  sufficient  to 
reconstruct  u{.  However,  we  have  shown  that  U*  does  not 
need  to  remember  his  past  strategies,  but  rather  his  past 
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decisions  (which  -re  easier  to  remember  since  they  are  just 
binary  numbers).  This  simplification  came  about  because  we 
were  able  to  reduce  a  function  minimization  problem  to  an 
equivalent  parameter  minimization  problem  (4.4). 

Finally,  observe  that  when  N= 2  in  the  above  theorem,  we 
get  the  results  of  [Hashemi  &  Rhodes,  1987],  and  when  Afcl, 
we  obtain  the  strategies  for  the  data  fusion  problem  in  [Ten¬ 
ney  &  Sandell,  1981a], 

V.  CONCLUSIONS 

In  this  paper,  we  investigated  a  class  of  decentralized 
sequential  decision-making  problems.  These  problems  dealt 
with  the  observing  of  a  common  region  of  the  environment  by 
a  multiple  of  distributed  sensors  in  order,  for  example,  to 
detect  the  presence  or  absence  of  a  certain  phenomenon.  We 
have  extended  the  results  of  (Tenney  &  Sandell,  1981a],  in 
which  only  one  time  step  is  considered,  to  a  class  of  distri¬ 
buted  detection  problems  that  permit  a  sequential  gathering  of 
the  observed  data. 

It  was  shown  that,  under  the  assumption  that  all  measure¬ 
ments  are  statistically  independent  when  conditioned  on  the 
hypotheses,  each  local  strategy  at  each  time  instant  is  given  by 
a  threshold  test  on  the  likelihood  ratio.  Moreover,  it  was 
shown  that  the  choice  of  a  threshold  by  any  detector  at  each 
step  depends  on  the  sequence  of  past  decisions  made  by  that 
detector. 

Throughout  this  paper  we  have  assumed  that  there  are 
only  two  possible  hypotheses  and  that  each  local  processor 
can  send  one  of  two  decisions  (i.e.,  a  binary  message)  to  the 
central  processor.  It  should  be  pointed  out  that  this  problem 
can  be  generalized  to  include  multiple  hypotheses  (i.e.,  M-ary 
hypothesis  testing)  and  multiple  decisions  (or  multiple 
actions).  The  development  is,  of  course,  more  tedious  but 
straightforward. 


Fig.  4.1:  The  decision  regions  at  time  1  x-.d  2  for  the  ihh 
local  processor. 
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ABSTRACT 


A  class  of  decentralized  sequential  detection  problems  is  investigated.  Under 
appropriate  independence  assumptions,  it  is  shown  that  at  each  time  instant  the  optimal 
local  strategies  are  given  by  threshold  tests  on  the  likelihood  ratios.  Furthermore,  it  is 
shown  that  local  decisions  depend  not  only  on  the  present  and  past  observations,  but  on 
the  past  local  decisions  as  well.  That  is,  for  each  local  processor,  there  exists  a  different 
threshold  for  every  different  sequence  of  past  decisions.  Examples,  computational  tech¬ 
niques,  discussions  of  the  difficulties  at  hand,  and  suggestions  for  further  exploration  of 
the  problem  are  presented. 
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I.  Introduction 


The  problem  of  decentralized  detection  and  decision  making  has  been  of  increasing  interest 
in  recent  years.  This  stems  from  the  fact  that,  although  it  is  suboptimal  compared  to  centralized 
detection,  decentralized  detection  is  in  many  cases  a  more  realistic  formulation  in  practical  appli¬ 
cations  than  its  centralized  counterpart.  To  name  a  few,  it  could  greatly  decrease  the  communica¬ 
tion  bandwidth  to  transmit  raw  data  to  a  central  site,  thereby  speeding  up  the  process  and  reduc¬ 
ing  the  communication  cost.  Also,  it  may  be  the  natural  way  to  treat  the  problem  when  there  are 
multiple  detectors  of  various  nature  or  located  at  different  geographical  sites.  Furthermore, 
decentralized  detection  may  just  be  imposed  in  situations  where,  due  to  enormous  amounts  of 
available  raw  data,  centralized  processing  of  the  information  is  not  feasible.  Other  issues  include 
system  reliability  and  integrity  in  the  face  of  failures. 

Several  classes  of  decentralized  detection  problems  have  been  studied  over  the  past  few 
years.  In  [1]  a  one-stage  decentralized  hypothesis-testing  problem  has  been  examined.  Some 
variations  of  [1]  have  also  been  reported  [2—4].  The  multi-stage  extension  of  the  problem  has 
also  been  studied  for  different  classes  of  detection  problems  [5-8]. 

The  object  of  this  paper  is  the  study  of  a  more  general  and  more  complex  class  of  multi¬ 
stage  distributed  detection  problems.  In  Section  II,  we  thoroughly  investigate  a  two-step,  two- 
person  decentralized  binary  hypothesis-testing  problem  in  which  each  detector,  after  obtaining 
each  of  its  two  observations,  sends  a  binary  decision  (0  or  1)  to  the  fusion  center  or  the  “supervi¬ 
sor,”  who  is  given  the  option  of  declaring  the  final  decision  (i.e.,  deciding  which  of  two  possible 
hypotheses.  H0  or  Hx,  is  true)  at  either  stage  depending  on  the  local  decisions  that  have  been 
received.  It  is  shown  that  when  all  the  observations  are  statistically  independent  given  either 
hypothesis,  the  optimal  local  strategies  are  governed  by  Likelihood-Ratio  Tests  (LRT’s)  in  a 
manner  similar  to  centralized  detection  [9]. 

The  class  of  problems  treated  in  this  paper  is  quite  different  from  the  one  examined  in 
[7-8],  in  which  each  detector  is  given  the  flexibility  of  either  stopping  and  making  a  decision  as 
to  which  hypothesis  is  true  or  continuing  to  the  next  time  stage.  In  this  paper,  however,  this  flexi¬ 
bility  is  given  to  the  supervisor,  therefore,  local  detectors  have  no  control  over  when  to  terminate 
their  observation  processes.  Only  after  the  supervisor  has  reached  a  decision  can  each  local 
detector  be  notified  to  rest  his  process.  The  local  decisions  are  binary  decisions  (0  or  1)  which 
provide  the  supervisor  with  some  information  about  their  observations.  (Unlike  [5]  this  informa¬ 
tion  is  not  served  as  sufficient  statistics.) 

This  feature  of  the  problem  has  an  interesting  implication.  It  turns  out,  as  one  might  expect, 
that  each  detector’s  decision  at  each  time  instant  depends  not  only  on  his  past  and  present  obser¬ 
vations,  but  on  his  previous  decisions  (which  he  remembers)  as  well.  For  instance,  at  time  two, 
there  will  be  two  thresholds  for  each  detector  corresponding  to  time-1  decisions  of  0  and  1. 
Moreover,  it  is  shown  that  each  detector’s  thresholds  at  any  time  instant  are  coupled  with  the 
supervisor’s  strategy  and  the  other  detector’s  thresholds  at  all  times,  as  well  as  his  own  future 
thresholds.  They  can  be  computed  off-line  and  through  the  simultaneous  solution  of  a  system  of 
nonlinear  equations.  Because  local  detectors  are  just  sending  binary  decisions  to  the  supervisor, 
it  is  not  at  a,’  obvious,  even  under  conditional  independence  assumptions,  that  the  local  strategies 
should  be  LRT’s.  The  major  contribution  of  this  paper  is  to  show  that  the  local  strategies  arc 
LRT’s. 

The  generalization  of  the  two-step  problem  to  a  multi-stage  problem  is  straightforward  and 
is  discussed  briefly  at  the  end  of  Section  II.  Section  III  characterizes  some  of  the  important 
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properties  of  the  optimal  solution.  The  results  of  this  section  suggest  a  significant  reduction  in 
the  number  of  computations  required  to  find  the  optimal  solution.  An  example  is  presented  in 
Section  IV,  and  two  techniques  for  solving  it  are  discussed.  The  example  illustrates  the  fact  that 
the  decision  at  any  time  instant  for  each  detector  depends  on  his  past  decisions  (which  are 
remembered  by  him  as  discussed  above)  as  well  as  his  present  and  past  observations.  Finally, 
Section  V  contains  some  concluding  remarks  and  discusses  some  of  the  prospects  for  future 
research  and  the  difficulties  at  hand. 

Throughout  this  paper,  we  use  the  Bayesian  criterion  in  determining  the  optimal  rules.  For 
notational  convenience,  let  Xk  denote  the  sequence  of  measurements,  Also,  let 

-E[/(X 2)1X1]  denote  the  conditional  expectation  of  f(X f)  given  X  t.  For  conveni¬ 
ence,  we  will  use  the  notation  p(x,y\H)  for  (-x.yf//).  That  is,  we  use  the  arguments  to  denote 
different  functions.  We  will  assume  that  the  set  of  states  of  nature  contains  two  elements,  called 
Hq  and  Hi,  and  that  H  refers  to  H0  or  H  x.  Unless  otherwise  indicated,  i  and  j  take  the  two 
values  0  and  1. 


II.  Two-Stage  Decentralized  Hypothesis  Testing 


There  are  a  variety  of  ways  in  which  one  can  pose  a  decentralized  detection  problem.  A 
realistic  formulation  depends  upon  the  concrete  physical  circumstances.  In  Problem  2.1  below, 
we  investigate  one  class  of  two-stage  decentralized  detection  problems  as  a  vehicle  for  gaining 
insight  into  more  general  aspects  of  decentralized  detection.  Unlike  [5],  we  are  not  interested  in 
providing  the  coordinator  with  sufficient  statistics;  rather,  due  to  such  factors  as  communication 
constraints,  each  detector  is  asked  to  make  a  binary  decision  (0  or  1)  at  his  local  site  and  to  send 
this  binary  message  to  the  supervisor.  Unlike  [7-8J,  it  is  the  supervisor  here  that  decides  when 
the  process  should  be  terminated;  the  local  processors  have  no  control  over  the  stopping  time. 
Naturally,  we  are  interested  in  the  manner  by  which  detectors  take  the  presence  of  one  another 
and  the  supervisor  into  consideration.  We  shall  use  the  Bayes  criterion  of  optimality  in  our 
approach. 


2.1.  Problem  Statement 

There  are  two  possible  hypotheses,  H 0  and  H  j ,  with  given  a  priori  probabilities  Pj  -P  {Hj } , 
M),l.  and  two  detectors  U  and  V.  There  is  no  communication  between  the  two  detectors.  Detec¬ 
tor  U  (respectively,  V)  takes  an  observation  X  \  (respectively,  Y\)  at  time  1  and  makes  a  decision 
u  1  ( respectively,  v ;)  based  on  his  own  observation,  where  u  1  and  v !  can  take  the  two  values  0 
and  l.  The  decisions  u  \  and  v  ]  are  sent  to  the  supervisor  S,  who  either  stops  and  makes  a  deci¬ 
sion  so  or  waits  until  the  second  set  of  local  decisions  have  arrived.  In  any  case,  U  (respectively, 
V)  obtains  a  second  observation  X2  (respectively,  Y 2)  at  time  2  and,  based  on  his  two  measure¬ 
ments  and  his  past  decision,  sends  its  decision  u2  (respectively,  v^J  to  the  supervisor,  where 
again  u2and  v2  can  take  the  two  values  0  and  l.  If  the  supervisor  chose  not  to  declare  his  deci¬ 
sion  s0  at  time  l,  he  would  be  obliged  to  do  so  at  time  2  based  on  the  four  available  local  deci¬ 
sions  u^iu^uf),  and  v^vj.v^. 

The  joint  pdf  p  (X  j  X  2,T  i,Y 2\H )  is  known  a  priori.  The  cost  incurred  by  declaring  H,  ( i.e. . 
S(f=i )  when  Hj  is  true  is  denoted  by  ctJ>Q.  (We  assume  that  c  10>coo  and  co\-c  u>  that  Is,  the  cost 
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of  erring  is  at  least  as  large  as  the  cost  of  no  error.)  Besides,  there  is  an  additive  delay  cost  of 
Co>0  if  the  supervisor  chooses  to  declare  his  decision  at  time  2.1  It  is  assumed  that  at  time  n  ,Xn 
is  available  to  U,  Yn  to  V,  and  wn=(uH  ,vn)  to  S,  where  ne{l,2}.  In  addition,  «,  (respectively, 
v  {)  is  available  to  U  (respectively,  V)  at  time  2. 

Find  optimal  local  strategies  (4>i  and  <p2  for  detector  U,  \j/]  and  y2  for  detector  V)  and 
optimal  rules  for  the  supervisor  (y{  and  yf)  so  as  to  minimize  the  expected  value  of  the  overall 
system  cost  J["K4W)/f]  with  respect  to  all  present  uncertainties  (X2,  Y2,  and  H )  over  all  possible 
strategies  4>,  vf,  and y,  where  '/=(Y\,y2).  and  V=(Vi>V2)- 

Solution 

From  the  problem  statement,  it  is  understood  that  w  1=<b1(A' ,),  u2=^2(uxJX2),  v^y^y!), 
v2=V2(v i-^2)«  5i=Yi(wi.Vi),  and  Sz=y2(u2,v2).  So,  for  example,  ux  is  measurable  with  respect  to 
X[,  etc.  Note  that  each  agent,  considered  separately,  has  a  classical  information  structure.  In 
addition,  since  the  detectors  are  not  allowed  to  communicate,  the  information  structure  is  static. 
The  following  independence  assumption  is  used  throughout  this  section: 

Assumption  2.1 

Observations  X  j ,  X  2,  Y ,,  and  Y 2  are  mutually  independent  given  H0or  H  v  That  is, 

p(X2,Y2\H)  =  p(X2lH)  p(Y2\H) 

and 

p(X2\H)  - p(X  x\H)p(X2\H)  ,  p(Y2\H)=p(Yx\H)p(Y2\H)  . 


Let  the  S’s  decision  at  time  n  be  denoted  by  sn  2  Fig.  2.1  depicts  the  decision  tree  for  Prob¬ 
lem  2.1  from  the  point  of  view  of  the  supervisor  S.  We  have  {0,1,2}  and  s2&  {0,1},  where 
si=2  implies  that  S  waits  for  u2  and  v2  to  arrive.  A  moment  of  thought  reveals  that  from  the 
supervisor’s  viewpoint,  the  problem  is  totally  centralized:  he  receives  ‘ ‘observations”  w  ,=(u  ,,v  [) 
at  time  1  and,  if  he  chooses  to  wait,  w£(u2,v-£  at  time  2.  Of  course,  S’s  “observations”  depend 
on  the  decisions  made  by  the  decentralized  decision-makers. 

In  this  regard,  once  the  local  strategies  are  given,  the  decisions  w,  and  w2  become  well- 
defined  random  variables  whose  joint  conditional  density  function  p(wx,w2\H)  can  be  determined. 
Consequently,  the  problem  faced  by  S  is  centralized  and  can  be  solved  using  Fig.  2.1. 

To  find  the  local  strategies,  define  T  =  and  pose  the  problem  in  the  framework  of  a 

dynamic  team  problem  [10].  Our  aim  is  to 

min  J(D  ^  min  Ex>  J^X2).^2)).//]  (2.1a) 

=  min  Exi  E^  Enw  J^X2),^2),//]  (2.1b) 

1  In  other  words,  Jfr  <>=(,//,  )=C,,+C0'  I  [decision  reached  at  time  2),  where  I  is  the  indicator  func¬ 
tion. 

2  S o€  (0,1 1  denotes  the  decision  made  by  S  once  the  process  is  terminated,  where  So=i  indicates  that  //, 
has  been  declared.  J,  refers  to  the  decision  made  at  tim en&{12)  with  {0,1,2}  and  s2^{0.1},  where 
S  i=2  indicates  that  S  continues  to  time  2. 
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\ 


where  we  have  invoked  Assumption  2.1  in  the  innermost  expectation  and  have  defined 

Wwjn-imMOJO]  (2.1c; 

There  is  very  little  known  about  the  global  optimal  solution  to  (2.1).  We  thus  focus  our  attention 
on  person-by-person  optimal  (PBPO)  solutions  since  a  global  solution,  if  one  exists,  must  be 
PBPO.  To  this  end,  fix  y*  and  y*  at  their  optima  and  minimize  (2.1b)  over  4>.  Since  v|/*  is 
given,  v ,  and  v2  are  well-defined  random  variables,  so 

Em  J^(X2)MY2)Jf]  =  E^h  J^(X2),v2Jf]  (2.2) 

Thus,  (2.1b)  becomes 

min  min  Ex»  Effl:  Evjw  Jy[$(X2),v2,//]  (2.3a) 

=  EXl  min  EXjlXi  min  Effll  EV^H  J^u2,v2Jf)  (2.3b) 

U,  U2lui 

where  we  have  used  the  fact  that  finding  the  optimal  u2  for  each  u  t  and  X2  is  equivalent  to  deter¬ 
mining  <()2,  and  that  finding  the  optimal  Ui  for  each  Xt  is  the  same  as  determining  <pl  [11],  Eq. 
(2.3b)  is  in  the  so-called  extensive  form  [10-11], 


Strategies  at  time  2 

The  innermost  minimization  in  (2.3b)  corresponds  to  time  2.  That  is,  given  u,  and  X2  at 
time  2,  the  problem  facing  detector  U  is  to 

min  Em2  E ^  JJu2,v2Jf)  =  min  Em3  l2(u2,H)  (2.4a) 

Hjlu,  uJU] 

where  we  have  defined 

X2(«2^)=EvW  5^u2,v2,H)  (2.4b) 

Gathering  appropriate  terms  in  (2.4a)  we  get 

U2=0 

P(//01X2)[X2(«1,1^/0)-Z2(u!,0,//0)]  £  p(//iix2)  [Z2(u1,0.//I)-Z2(u1,l^/i)]  (2.5) 

«j=i 


where  we  use  the  standard  decision  theory  notation 


fwV 


<=> 


Condition  C  i  is  true  if  /  (x)>t 
Condition  C2  is  true  if  /  ( x)<t 
Either  condition  is  true  if  /  (x)=t 


Assume  that 

£2(«„0,//1)>X2(M|.lItfl) 


(2.6a) 

(2.6b) 


Then,  the  expressions  within  the  square  brackets  on  each  side  of  (2.5)  become  nonnegative,  and 
we  can  obtain  the  following  strategy  for  detector  U  at  time  2 

K(x2)%'u,("i) 

Uj=l 


(2.7a) 


where 


^  \  iO,H  i  )—^2^U  i ,  1  J~f  i ) 

tui{U  x) ~  Z£ul,lM0>-'ZziuX'OJia) 
and  the  likelihood  ratio  XUl  is  defined  as 


XUj(X2)  = 


p(Hx\X~) 


In  a  similar  manner,  we  get 

»3=0 

^v,^2)  £  fv,(Vl) 


u  t=0.1 


where 


A  -2(0. V  1 A  l)~ E;(  1  ,V  i  i) 

fvj(Vl)_  E2(1.v,//0)-s2(0.v1^/0)  ’ 


v  i=0,l 


(2.7b) 


(2.7c) 


(2.8a) 


(2.8b) 


and 

ZMy'jf)  =  Euw  J ^u2,v2Jf)  •  (2.8c) 

Note  that,  due  to  Assumption  2.1,  tUi  (respectively,  rVl)  is  independent  of  X2  (respectively,  Y2). 
Consequently,  the  local  strategies  at  time  2  are  described  by  LRT’s.  Furthermore,  it  is  interesting 
to  see  that  tUl  and  rVl  depend  on  the  past  decisions  u  ^  and  v  t,  respectively.  That  is,  there  are  two 
thresholds  for  each  detector  at  time  2,  corresponding  to  time-1  decisions  of  0  and  1,  so  that  in 
general  no  single  threshold  can  fully  describe  the  strategy  of  a  detector  at  time  2. 

It  can  also  be  shown  that  U’s  strategy  at  time  2  depends  on  V’s,  as  well  as  the  supervisor’s, 
strategies  at  time  1  and  2.  This  will  be  discussed  later.  We  have  thus  determined  the  optimal 
local  strategies  i>2  and  at  time  2  given  (4>i,Ht,Y)  and  (4>,t|/],y),  respectively. ^ 


Strategies  at  time  1 

To  find  optimal  local  strategies  at  time  1,  we  first  prove  the  following  lemma: 


Lemma  2.1 

Let  x  and  y  be  two  random  variables  whose  joint  density  function  is  given,  and  let  f  (x.y) 
be  a  function  of  x  and  y .  Suppose  z  is  an  extraneous  random  variable  whose  joint  density  func¬ 
tion  with  x  and  y  is  well  defined.  Then 

E  \f  (Jr,y)]  =  Ez  Eyij  E.^  [f  (x,y)]  '  2*  > 


Proof 

It  is  obvious  that,  by  using  nested  conditional  expectations,  we  have 

E  [f  (x,y)]  =  EZJI  \f  <x,y)}  =  Ew  [f  (x,y)]  =  Ez  Eylz  E^  [f  (r,y)| 

We  wish  to  show  that  the  innermost  expectation  need  not  be  conditioned  on  /  To  , . 
that 

3  In  fact,  as  we  shall  discuss  in  Remark  2.5,  4>i  and  can  be  replaced  by  u  ]  and  v  ■ .  rcspei. : . . 


4 


-6- 

If  (x,y)]  =  Ey  Elj>  [/  (x,y)] 

Now,  let 

gOO  =  Ejy  [f  (*o01 

Then,  after  introducing  the  extraneous  variable  z ,  we  obtain 
If  (*.>0]  =  Ey  g(y)  =  Ez  Ejhj  g(y) 
which  is  what  we  intended  to  show.  □ 

One  explanation  for  Lemma  2. 1  is  that  /  in  the  lemma  is  only  a  function  of  x  and  y ,  and 
not  of  z .  We  can  now  prove  the  following  corollary: 

Corollary  2.1 

Under  Assumption  2.1,  we  have 
E[J]  =  E[Jy  (u2,v2Jf)] 

-  Ex,  EUllx,  EWXl  EUl|Ul>w  2 'oku2 JU) 

Proof 

Again,  here,  the  innermost  expectation  need  not  be  conditioned  on  X,.  To  show  this, 
invoke  Assumption  2.1  and  properties  of  nested  expectations  to  get 

E[J]  =  EuV/  Ev,w  Ifu2,v2Jf)  =  EUiM  EuiUxJi  Zz(u2Jf) 

As  in  the  lemma,  define  the  function  g  to  be 

g(uidf)  =  EujUtM  Lziu2^) 

and  introduce  the  “extraneous”  random  variable  X  ]  to  get 

£[J]  =  EUl>w  g(uxJf)  =  Ex,  EUl>wlx,  g(u  iff)  =  Ex,  EMi1x,  Ew1Xi  g{ux,H) 

This  completes  the  proof.  □ 

Now,  expand  (2.10b)  over  u  x  to  get 

i 

Ex,  Z  P(“iKi)  Emxl  EUl|Ul^  I^(u2df)  (2.11a) 

u,=0 

=  Ex,  p(«i=01X  j)  (  E^  EUj|Bi=0i//  2^(0, u2ff)  ] 

+  Ex,  p(u  i=l  1X0  (  E//ix,  EUj|u )=1^  2^(1, u2JH)  ]  (2. 1  lb) 

=  Ex, p(u  1=11X0  Eh\x,  {  EUzjUx=\'H  2^(1, «2^/)  -  EUj|M)=0iW  2^(0, UzJf) } 

+  Etf  Eujui*oji  2^(0, u 2J1)  (2.11c) 

where  we  have  used  the  fact  that  p(u  !=01X  x)=l—p(u  1= 1 IX  0.  Denote  the  expression  within  the 
square  brackets  in  (2.11c)  by  p(X  0-  Since  the  last  term  in  (2.11c)  is  constant  with  respect  to 
p(u  jlX  0,  to  minimize  the  total  cost,  we  must  minimize 

Ex,  O(«,=llX0ft(X0J 

Now,  since  p(ui=HX0e  {0,1 },  it  is  obvious  that  to  minimize  the  above  we  must  have 


(2.10a) 

(2.10b) 


(2.  lid) 
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j  0  \i(Xl)>  0 

/>(M,=llX,)=j  J  ^(Xl)<0 
That  is,  we  have 

«i>0 

iK*i)  £  o 

«i=l 

After  expanding  \i(X  j)  over  H  and  collecting  appropriate  terms,  we  obtain 

“H) 

p(H0\XO  [Z,(l^/o>-Si(0^o)]  £ 

u,=l 

where  we  have  defined 

EuMiH  12  (U2J{) 


(2.1  le) 

(2.1  If) 

(2-1  lg) 

(2.1  lh) 


Assume  that 

(2.12a) 
(2.12b) 


I1(l,//o)>I1(0,//0) 
Z\(0*H i)  >  Zi(l,// 1) 


Then,  the  expressions  within  the  square  brackets  on  each  side  of  (2.11g)  become  nonnegative, 
and  we  can  obtain  the  following  strategy  for  detector  U  at  time  1 

KWoX'u,  (2.13a) 

“1=1 

where 


4  It(0 y/!>-Ii(l^i) 
Zx(Uioy^(OMo) 


with  the  usual  definition  for  the  LR  (Likelihood  Ratio).  Similarly,  for  V 

v,-0 

K(Y0 

v,=l 

where 

4  HKO^-EiO 

tv'  E,(1^o)-Hi(0^o) 

Hi(v  i^f)  -  EVj|yijtf  E2(y2,H)  - 


(2  14a) 

(2.14b) 

(2.14c) 


It  is  crucial  to  note  that,  because  of  the  presence  of  the  term  EuJu^  in  the  definition  of  Si 
in  (2.1  lh),  rUl  in  (2.13)  depends  on  <J>t.  However,  if  <j>|  is  fixed  at  its  optimum  $)*  (and  all  other 
strategies  are  fixed),  then  rMl(<J>i)  becomes  a  constant,  whereby  (2.13a)  lends  itself  to  a  LRT.  The 
whole  point  is  that  tUl  as  defined  by  (2. 13b)  is  functionally  independent  of  X  1(  so  that  an  off-line 
calculation  (as  explained  below  in  Remark  2.4)  is  possible. 

Consequently,  decision  rules  given  by  (2.13M2.14)  above  are  LRT’s,  and  they  define  the 
optimal  strategies  and  \j/i  for  time  1  (given  that  all  the  other  strategies  are  fixed  at  their 
optima).  We  can  now  show  that  the  right-hand  side  in  (2.7b)  depends  on  rVi,  rVj(0),  and  fUj(l);  i.e., 
U’s  thresholds  at  time  2  are  coupled  with  V’s  thresholds  at  time  1  and  2.  (A  similar  argument 
holds  for  V’s  thresholds  at  time  2.)  To  see  this,  use  the  definition  of  rUj(u  ])  in  (2.7b)  and  note  that 
in  (2.4b)  we  have,  for  example, 
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p(y2=QO\H)  =  JJ  p(Y2\H)p(Yl\H)dYidY2- 

{Y1:  KW'-puJO)  *nd  KVfetJ 

As  a  result,  we  obtain 

Wui)  =  fu2(A,.'v1(0),fVj(1))  ,  u  i=0,l  (2.15a) 

rv1(vi)  =  fv1(tu,.^(0),r14j(l))  ,  v1=0,i  (2.15b) 

It  can  also  be  shown  that  the  right-hand  side  in  (2.13b)  depends  on  fVi,  rV2(0),  rV2(l),  rUj(0),  fU2(l), 
and  tUl;  i.e.,  U’s  threshold  at  time  1  is  coupled  with  V’s  thresholds  at  time  1  and  2,  as  well  as  his 
own  thresholds  at  time  2.  (A  similar  argument  holds  for  rVi,  too.)  To  see  this,  use  the  definition  of 
tUi  in  (2. 13b)  and  It  in  (2. 1  lh).  We  then  obtain 

t u ,  =  fu,(ru,.tWj(0),/Uj(l),rVl,rVl(0),rVj(l))  (2.16a) 

'v,  =  fv , ( A , .t»2(0).tv 2(  1  ).tu , .tUl(0),rU2(  1 )) .  (2. 16b) 


Remarks 

2.1.  Note  that  fUl  itself  is  a  function  of  tUl  because  of  the  presence  of  the  term  EuJ UltH  in  the 
definition  of  l!  in  (2.1  lh).  Similarly,  fVl  is  a  function  of  rV].  So,  for  example,  if  fU2(0),  rU2(l),  rVl, 
rV2(0),  and  rV](  1 )  are  fixed  at  their  optima,  then  the  optimum  rUl  can  be  determined  by  finding  the 
fixed  point  of  fMl  in  (2.16a). 

2.2.  It  should  be  pointed  out  that  the  functions  in  (2.15)  and  (2.16)  are  parameterized  by  y.  How¬ 
ever,  once  y  is  fixed  at  its  optimum,  local  thresholds  can  be  determined  through  the  simultaneous 
solution  of  six  nonlinear  algebraic  equations  of  the  form  (2.15M2.16)  in  six  unknowns.  Since 
the  total  possible  choices  of  y  is  finite,  one  can  find  the  optimal  thresholds  through  (2. 15) — C2. 16) 
for  each  choice  of  y  and  pick  the  one(s)  that  correspond  to  the  smallest  cost.  Each  solution  will 
result  in  a  PBPO  solution  to  Problem  2.1. 

2.3.  We  have  shown  that  the  local  strategies  for  any  globally  optimum  or  PBPO  solution  must  be 
LRT’s  against  a  set  of  precomputable  thresholds.  Although  little  can  be  said  about  the  existence 
of  either  global  or  PBPO  solutions,  we  can  show  that  a  globally  optimum  (and  therefore  a 
person-by-person  optimal)  solution  exists  if  the  local  strategies  are  restricted  to  being  LRT’s 
against  thresholds.  This  globally  optimum  solution  under  this  constraint  must  either  be  globally 
optimum  in  the  absence  of  constraints  or  no  global  optimum  exists. 

To  show  the  existence  of  a  globally  optimum  solution  when  local  strategies  are  restricted  to 
LRT’s  against  thresholds,  note  that  a  LRT  of  the  form 

p(H0\X)  g> 
p(H jlX)  &  * 

is  equivalent  to  the  following  LT  (Likelihood  Test) 

u= 0 

p(HJX)  Z  x 

where  x=f/(l+f)  e  [0,1].  We  refer  to  a  solution  in  which  local  strategies  are  restricted  to  LT’s 
against  thresholds  as  a  “threshold”  solution.  Now,  fix  the  supervisor’s  strategy  y  satisfying 
assumptions  (2.6)-(2.7),  and  similar  ones  for  the  other  detector.  (These  assumptions  arc 
equivalent  to  Corollary  3. —  below.)  Since  for  such  a  y  PBPO  solutions  are  LT’s,  we  can  focus 
our  attention  on  the  existence  of  optimal  thresholds,  xMi,  etc.  Define 

L(z)  =  E{jp2,v2Jf)} 
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where  x  =  (xUl,  xUl(0),  xUj(l),  xVl,  xVj(0),  xVl(l) ).  Since  each  x  lies  in  the  interval  [0,1],  the  domain 
of  L  is  a  unit  hypercube  (which  is  a  compact  set).  The  cost  function  L  can  be  written  as  a  sum  of 
32  tenns  corresponding  to  25  possible  sequences  of  (u2,v2JT>-  Each  of  these  terms  involves 
integrals  over  regions  that  are  functions  of  the  thresholds,  x. 

Assuming  that  the  a  priori  pdfs  are  piecewise  continuous  in  the  observations,  it  is  clear 
that  these  integrals  are  jointly  continuous  in  the  thresholds.  Consequently,  L  is  jointly  continuous 
in  the  six  thresholds.  That  is,  L  is  a  continuous  function  over  a  compact  set.  Therefore,  by  the 
Weierstrauss  theorem  L  is  bounded  and  assumes  its  minimum  in  that  set.  The  minimum  point 
then  constitutes  a  “threshold”  solution.  It  is  clear  that  because  of  the  finiteness  of  the 
supervisor’s  possible  strategies,  repeating  this  process  to  exhaust  all  enumerations  of  such  y's 
yields  the  globally  optimal  threshold  solution.  This  “threshold”  solution  is  globally  optimal  if  a 
global  optimum  exists.  If  a  globally  optimal  policy  does  not  exist  but  a  PBPO  one  does,  then  this 
threshold  solution  is  a  PBPO  solution. 

The  preceding  discussions  can  be  summarized  by  the  following  theorem. 

Theorem  2.1 

Under  Assumption  2.1,  the  person-by-person  optimal  solutions  to  Problem  2.1  are  deter¬ 
mined  as  follows  : 

(a)  Optimal  local  strategies  <j>!  and  at  time  1  are  described  by  LRT  s 

if  2^0 

Ul=*l 

V2*0 

KWi)  S  'v, 

v,=l 

The  thresholds  tUy  and  fV)  must  satisfy  Eqs.  (2.13b)  and  (2.14b). 

(b)  Optimal  local  strategies  §2  and  y2  at  time  2  are  also  described  by  LRT  s 

If  2=0 

K&2)  £  •  «i=0.1 

If  2*1 
,  yJ-° 

K,(Y2)  $  /v,(v,)  ,  v1=0,l 

Vj-l 

The  thresholds  tUl(u  i)  and  rVj(v  t)  must  satisfy  Eqs.  (2.7b)  and  (2.8b). 

(c)  Supervisor’s  optimal  strategy  is  based  on  a  table  look-up: 

i  =  Yi(«  t.v i)  and  S2  =  y2(u2,v2)  (2.19) 


(2.18a) 

(2.18b) 


(2.17a) 

(2.17b) 


where  and  y2  can  be  determined  from  Fig.  2.1  if  the  local  thresholds  are  given. 


Remarks 


2.4.  Due  to  Assumption  2.1,  all  thresholds  in  (a)  and  (b)  can  be  computed  off-line.  As  was  dis¬ 
cussed  earlier,  each  detector’s  thresholds  at  time  2  are  coupled  with  the  other  detector’s  thres¬ 
holds  at  both  times,  and  each  detector’s  threshold  at  time  1  is  coupled  with  the  other  detector’s 
thresholds  at  both  times,  as  well  as  his  own  thresholds  at  time  2;  furthermore,  these  thresholds  are 
coupled  with  the  supervisor’s  optimal  strategies  y.  If  we  define  the  threshold  vectors 

(0  tu  =(t14l,tUj(0),rUj(l))  07)  tv  =(tVl,rVl(0),rVl(l))  (2.20) 

then  we  have 

0)  tu  =  fj (tv)  00  ^  =  fj (tu)  (2.2!) 

where 

fj ,  fj :  R3— >R3  .  (2.22) 

In  other  words,  U’s  and  V’s  threshold  vectors  are  coupled  and  can  be  found  by  solving  two  non¬ 
linear  algebraic  vector  equations  of  the  form  (2.21)  in  two  unknowns,  tu  and  ty . 

2.5.  We  have  shown  that,  e.g.,  U’s  decision  at  time  2  depends  not  only  on  both  his  observations, 
X  [  and  X2,  but  also  on  his  past  decision,  u  That  is, 

«2  =  <t>2(“l.X2) 


This  resulted  in  two  thresholds  for  U  at  time  2,  rHj( 0)  and  rUj(l).  Of  course,  once  U’s  strategies  at 
time  2  are  given,  then  one  can  identify  a  region  in  the  Xu  X2  plane  over  which,  say,  is  sent 
regardless  of  u  i  (Fig.  2.2).  It  is  important  to  point  out  that  this  decision  region  may  not  lend 
itself  to  a  threshold  test.  To  put  it  another  way,  in  general  no  single  threshold  can  completely 
express  U’s  optimal  strategy  at  time  2.  However,  if  one  divides  the  X  j,  X2  plane  into  two  regions 


for  u  1=0,1,  then  over  each  of  these  two  regions,  A0  and  A  lt  the  decision  u2  is  decided  via  a  LRT 
with  thresholds  rMj(0)  and  rUj(l),  respectively.  This  is  why  two  thresholds  are  needed  for  each 
detector  at  time  2. 

2.2.  Generalization 

In  this  subsection,  we  state  without  proof  the  generalization  of  Problem  2.1  to  a  multi-stage 
hypothesis-testing  problem.  The  problem  can  also  be  easily  extended  to  include  multiple  detec¬ 
tors,  hypotheses,  and/or  actions.  As  before,  we  make  the  following  independence  assumption: 

Assumption  22 

Suppose  there  are  two  detectors  each  taking  N  observations 

XN  =  {XxJ(2,...XN)  and  YN  =  (r„T2 . YN) . 

We  assume  that 

piXN \H)  =  p(X  ,1/0 P(X2\H) ...  pQCN  W)  • 
p(YN\H)=p{Yx\H)p(X2\H)  ...p(YN\ff) , 


and 


p(XN  ,Yn  I H)  =  p(XN  I H)  p(YN  I H) 
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for  all  the  hypotheses  H  in  the  set  of  states  of  nature. 

The  multiple-stage  problem  is  identical  to  Problem  2. 1 ,  except  we  allow  each  detector  to 
take  N  observations  and  send  a  decision  after  each  observation  is  made.  The  cost  incurred  by 
declaring  //,  when  Hj  is  true  is  denoted  by  ctJ .  There  is  also  a  non-negative  additive  delay  cost 
of  cn  (1  <n<N)  when  S  waits  for  decisions  un  and  v"  to  arrive.4  (In  Problem  2.1  we  had  cx= 0  and 
c2=Cq.)  Again,  each  detector  has  access  to  all  his  past  (and  present)  observations  and  past  deci¬ 
sions. 

Theorem  22 

Under  Assumption  2.2,  the  person-by-person  optimal  strategy  of  each  local  detector  at 
every  time  step  for  the  above  problem  is  described  by  a  LRT  against  a  threshold,  where  the  thres¬ 
hold  depends  on  the  choice  of  prior  decisions  of  that  local  detector  (but  not  on  those  of  other 
local  detectors).  For  instance,  the  optimal  strategy  of  Detector  U  at  time  n  is  described  by  the 
following  LRT  s: 

K(XN)  £  UK*"1)  (2.23a) 

u*=l 


u.=0 

X*(X")  £  tu.(un~l)  ,  1  <n<N 


(2.23b) 


«!=<) 
M  1=1 


(2.23c) 


where 


,  ^4  i,(«'",.oi/,>-rB(u"-,,i^I) 

)  =  - j - j - 

IB(u'-1,l^0)-I,(u"-1,0^0) 


1  <n  <N 


(2.24a.) 


. Mmllu.Ml.N(uNJD  ,  (2.24b) 

and 

ZN  ( uN  =  J fus  ,vNff)-  (2.24c) 

Note  that  for  every  detector  at  time  n  there  are  2"-1  thresholds,  each  corresponding  to  a  different 
set  of  past  decisions  (which  are  remembered  by  that  detector). 


Again,  it  can  be  shown  that  each  detector’s  thresholds  at  any  time  instant  are  coupled  with 
the  other  detector’s  thresholds  at  all  times  and  the  supervisor's  strategy,  as  well  as  his  own  future 
thresholds. 

Finally,  observe  that  when  iV=l  in  the  above  theorem,  we  get  the  strategies  for  the  data 
fusion  problem  in  [  1  J. 


4  We  are  supposing  for  completeness  that  S  is  allowed  to  reach  a  decision  at  time  0  based  solely  on  the  a 
priori  probabilities,  p o  and  p  \ . 
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III.  Characterization  of  the  properties  of  optimal  solutions 


In  this  section,  some  useful  properties  associated  with  PBPO  solutions  as  given  by 
Theorem  2. 1  are  derived.  Later  on,  we  will  see  how  these  properties  suggest  a  substantial  reduc¬ 
tion  in  the  number  of  computations  required  for  determining  the  optimal  strategies. 

Each  PBPO  solution  is  characterized  by  the  properties  that  will  shortly  follow.  The  major 
results  in  this  section  are  Theorems  3. 1-3.3,  which  suggest  a  significant  reduction  in  the  number 
of  possible  enumerations  of  the  optimal  supervisor’s  strategy  y,  thus  making  the  computation  of 
the  solutions  more  feasible.  We  assume  that  thresholds  lie  in  the  extended  positive  real  line 
[0,°°].  Consequently,  a  threshold  at  implies  that  only  one  of  the  decisions,  namely  1,  gets  sent 
irrespective  of  the  actual  observations.  We  start  out  by  proving  some  preliminary  lemmas.  In  the 
remainder  of  this  section,  the  strategies  are  assumed  to  be  PBPO  as  given  by  Theorem  2.1  (i.e., 
strategies  are  LRT’s  and  assumed  to  be  “monotonic”  in  that  local  decisions  are  identified  with 
the  hypotheses).  For  convenience,  it  is  also  assumed  that  p(ultu2IH),  p(v  1,v2l//),  and  p(H)  are 
positive.  This  latter  assumption  simplifies  the  development  that  will  follow  shortly,  but  the 
results  of  Theorems  3. 1-3.3  below  still  hold  in  the  absence  of  this  assumption. 

Lemma  3.1 

Let  tUi  =  (p  i/po)rUl.  Then,  for  Detector  U’s  PBPO  strategy  at  time  1  we  have 
P(u  X=Q\H 0)  >  _  >  p(u  i=l\H0) 

P(«i=0IWi)  "  r“‘  "  p(ui=H//,)  ’  (31) 


Proof 

First  note  that 


<=> 


p(.Xx\H  q) 
p{Xx\Hx) 


^(Pi/Po)f  -t 


(3.2) 


Using  the  PBPO  local  strategy  for  U  given  by  Eq.  (2.13),  we  have 

p{u  X=0\H q)  =  j  p(X  X\H q)  dX  j  (3.3) 

(XvKQCiyziJ 

j  p(Xx\Hx)dXx  =  TUi  p(u!=OI//,) 

while 

pfu,=UW0)=  J  p(X  X\H0)  dX ! 

{Xf.K&xysij 

Ztu,  J  p(Xx\Hx)dXx=TUl  p(ux=\\Hx)  (3.6) 

The  result  is  then  immediate.  □ 

One  way  of  interpreting  (3.1)  is  that  the  product  of  the  conditional  probabilities  of  two  true 
decisions  p(u  X=0\H 0)  p(u  i=l  I//  ])  is  at  least  as  large  as  the  product  of  the  conditional  probabilities 
of  two  wrong  decisions  p(ux^)\Hx)  p(ux=\\H0).  The  following  corollary  follows  immediately 


(3.4) 

(3.5) 
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i 

j 


\ 


from  Lemma  3. 1  and  will  be  useful  later  on. 

Corollary  3.1 

For  U’s  PBPO  decision  at  time  1,  we  have 

p(u  i=OI// q)  >  p(u  !=0)  S  p(u  ,=01// ,)  (3.7  a) 

p(u  ,=1  \H  j)  >  p(u  ,=1 )  >  p{u  1=ll//0)  (3.7b) 

Clearly,  there  are  counterparts  to  Lemma  3.1  and  Corollary  3.1  for  V.  Corollary  3.1  has  a 
very  intuitive  appeal:  it  says,  in  effect,  that  the  probability  of  making  an  error,  say,  p(u  1=0I H  {),  is 
no  larger  than  the  probability  of  declaring  a  true  decision,  say,  p{u  I=OI//0).  Note  that  in  the  pre¬ 
vious  lemma,  we  have  used  the  fact  that  local  strategies  are  given  by  LRT’s.  Now  we  can  prove 
the  following  lemma. 

Lemma  32 

For  PBPO  decisions  u ,  and  v  ,  at  time  1 ,  we  have 

(0  P(Hq\0,v  |)  >p(H 0I  l,v  ,)  (it)  p{H 0\u  i,0)  t  p(H 0\u  i,i) 

U Hi)  p(//ill,v1)>p(W1IO,v1)  (iv)  piHx\ux,\)7>p{Hx\ux,Q) 


Proof 


To  prove,  say,  inequality  (i),  invoke  the  Bayes’  rule  and  use  the  fact  that  u  j  and  v ,  are  con¬ 
ditionally  independent.  Then,  it  must  be  shown  that 


1+ 


p(u1=ll//1)p(v1l//1)pl 
p(u  i=H7f  0)p(v  \'H o)pO 


1+ 


pju  i=OI// i)p(v  \\H  ])pl 
p(u  i=OI//  0)p(v  i  \H  0)pO 


(3.8) 


and,  after  some  simplifications,  we  need  to  show  that 
pju  i=OI// q)  ^  p(ui=ll//0) 

p(u i=OI//,)  ~  p(« i=li// 1) 

But  this  is  precisely  Lemma  3. 1 .  □ 


We  are  now  in  a  position  to  prove  the  following  important  'heorem.  It  will  be  discussed 
later  how  Theorem  3. 1  and  its  companion  Theorem  3.2  enhance  our  algorithm  to  compute  the 
optimal  thresholds.  The  set  C  is  the  set  of  all  pairs  (u  i,v ,)  for  which  S  continues  to  time  2. 

Theorem  3.1 

Let  Y]  define  a  person-by-person  optimal  strategy  for  the  supervisor  at  time  1  as  given  by 
Theorem  2.1,  and  take  ij,k,l  e  {0,1}  with  i  *j .  We  have 

(a)  */ YiOVH  .  then  yx(jj)=j  provided  (jj)<&2. 

(b)  if  Yi (i,i)=j .  then  Y](k,f)=j  provided  (fc.OfC 
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Proof 

We  prove  (a)  for  /=0  and  j- 1.  The  generalization  to  other  contingencies  is  straightforward. 
So,  suppose  S  receives  (ui,v()=(0,I)  and  declares  $0=1  at  time  1.  We  will  show  that  if 
(mj.v  j)=(1. 1)  is  received  by  S,  then,  if  he  decides  to  stop  at  time  1,  he  must  declare  $q=1  again. 
Since  Yi(«1v1=01>=1,  using  Fig.  2.2  we  obtain 

c10p(//0l«iVi=Ol)  +  cup(//1lu1v1=01) 


£  CooPftfolwiVi^^  +  CoiPCtfi^iVi^1)  (3.10) 

or,  equivalently, 

(c  iQ— c oo)p(tf0luiv  l^l)  -  (c oi- 0  u)  P  iv  l^l)  (3.11) 

where  we  have  used  the  fact  that  c  10>c  n  and  c0i^c  u.  We  wish  to  show  that  the  above  inequal¬ 
ity  also  holds  true  for  ( u  ltv  i)=l  1.  It  suffices  to  show  that 

p(//0l«  iv  ,=1 1)  <p(/f0l«iv  i=0D  (3. 12a) 

p(H  jlu  jV  [=01)  ^ p(//  jIm  i v  i=l  1)  (3.12b) 

But  (3.12)  is  precisely  Lemma  3.2.  This  completes  the  proof.  □ 


In  simple  terms.  Theorem  3. 1  indicates  that  if  the  supervisor  stops  and  declares,  say,  a  0 
upon  receiving  conflicting  decisions  (0,1  or  1,0)  at  time  1,  then  he  must  also  stop  and  declare  a  0 
if  he  receives  the  nonconflicting  decisions  0,0.  This  result  is  in  part  due  to  our  “monotonicity” 
assumption  stated  in  the  beginning  of  the  section. 

Properties  of  Optimal  Solutions  at  time  2 

We  shall  now  move  on  to  time  2  and  establish  similar  properties.  The  following  lemma  is 
the  analogue  of  Lemma  3.1  for  time  2,  and  its  proof  is  similar  to  the  proof  of  Lemma  3.1. 

Lemma  3-3 

Let  TUl(u  i )  -  (p  \!p  o )tufu  [).  Then,  for  Detector  U's  PBPO  strategy  at  time  2  we  have 
p(u\,u-f=G\H q)  _  ^  ^  p(ux,u2=\\H0) 

p(uhu2=0\H i)  ~  tu*Ul  ~  p(uhu2=\\H ])  ( 

where  u  !=0,  1 . 

Lemma  3.3  can  be  interpreted  in  the  same  way  as  Lemma  3.1.  We  shall  present  withoui 
proof  the  following  corollaries  which  will  be  useful  later  on.  The  proofs  follow  by  applying 
Bayes’  rule. 


Corollary  3 2 

Under  the  conditions  of  Lemma  3.3,  we  have 
p(K2=OlMi^/o)  >  piu^HQ  >  p(u2=H«i^o) 

p(«2=0lMi^i)  ~  ‘u'U'  p(« jl//0)  "  p(U2=llu,^/|) 


(3.14) 


1 
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Corollary  33 

Under  the  conditions  of  Lemma  3.3,  given  u  i  at  time  2,  we  have  for  u2  that 

p(«2=Otu  1^/0)  p(u-fd)\u  i)  p(uz=0\u  \,H  j)  (3.15a) 

p(u2=\\u  iM i)  > p(uz=l\u  0  > p(ui=l\u  ,^/0)  (3. 15b) 

Clearly,  there  are  counterparts  to  Lemma  3.3  and  its  corollaries  for  V.  Corollary  3.3  can  be 
interpreted  in  the  same  way  as  Corollary  3.1.  Combining  Corollaries  3.1  and  3.3,  we  can  get  the 
following  corollary. 

Corollary  3.4 

For  Detector  U’s  PBPO  decisions  at  times  1  and  2,  we  have 

p(u !  u  2=001//  0)>p(u{u  2=00)  >p(«i«  2=00\H  t)  (3.16a) 

p{u\u 2=1  \\HX)  > p(u  i«2=l  1)  ^ p(u  iU2=l  1I//q)  (3.16b) 

We  can  now  state  the  following  lemma,  whose  proof  is  similar  to  the  proof  of  Lemma  3.2. 

Lemma  3.4 

For  PBPO  decisions  u2,v2  we  have 

(i)  p(H  0luj,0,v2)  >/>(// 0lu  i.l.v2) 

(Hi)  p(H !  \u ! ,  1 ,  v  2)  >  p(H  i  lu  j  ,0,  v  2) 

In  particular,  for  every  pair  (u  lfv  t),  we  have 

( v)  p(H  0lu ! , v  i  ,0,0)  >  p(H  0I«  i , v , ,//)  >  p(H  0\u  i , v , ,  1 , 1 ) 


00  p(H 0lw 2,v  j ,0)  >p(H 0lu2,v  j,l) 
0 v)  p(H ,  lu  2,v  j ,  1 ,)  >  p(H ,  lu  2, v !  ,0) 


(vi)  p(H ,IU|,v  ,,1,1)  ^ p(H \ l«  i , v , ,i J)  >  p(H 0\u , , v , ,0,0) 


where  ij  e  {0,1}  with  i  *j . 

We  are  now  in  a  position  to  establish  the  following  important  theorem,  which  is  the  analo¬ 
gue  of  Theorem  3.1  for  time  2. 

Theorem  3  2 

The  results  of  Theorem  3.1  are  also  applicable  to  y2 (u2,v2),  the  supervisor’ s  PBPO  strategy 
at  time  2.  Specifically ,  for  any  ( u  ,,v  ,)e  C  and  ijjc,l  e  {0,1}  with  i  *j ,  we  have 

(a)  if  y2(u ,  ,v ,  ,ij)=j ,  then  y2(u  i ,  v ,  JJ  )=j . 

(b)  if  y2(U\,v  \  ,i,i  )=j ,  then  y2(u  ,.v  ,,*,/  )=j . 
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Proof 

Use  a  line  of  argument  similar  to  that  in  the  proof  of  Theorem  3.1.  □ 

In  simple  terms.  Theorem  3.2  indicates  that  if  the  supervisor  declares,  say,  a  0  upon  receiv¬ 
ing  conflicting  decisions  (0,1  or  1,0)  at  time  2,  then  he  must  also  declare  a  0  if  he  receives  the 
nonconflicting  decisions  0,0.  We  can  now  prove  the  following  theorem,  which  is  another  version 
of  Theorem  3.2  but  sheds  some  more  light  on  the  structure  of  the  supervisor’s  optimal  strategy. 

Theorem  3  3 

Theorem  3.2  can  be  stated  differently  as 

Jy (u  j,v  „l,v2^0)  >  J y(u  j,V  i,0,v2,// o)  (3.17) 

with  similar  inequalities  for  v2and  for  Hx. 

Proof 

There  are  two  possible  cases: 

(A)  The  decision  pair  (ux,vx)e  C:  if  y2(u1,v1,0,v2>=0,  then  the  RHS  of  (3.17)  is  Co+Cqo  and. 
therefore,  less  than  or  equal  to  the  LHS.  But  if  y 2(u  j.v  [.O.v^l,  then  the  RHS  of  (3.17)  is 
Co+Cio;  however,  according  to  Theorem  3.2,  we  must  also  have  Y2(wi-vt*1'V2)=l,  whence 
the  LHS  is  also  cQ+c  10  resulting  in  an  equality  in  (3.17). 

(B)  The  decision  pair  (w^Vj)  ^C:  use  the  results  of  Theorem  3.1  and  an  argument  similar  to 
CASE  (A).  □ 

We  can  also  establish  the  following  important  properties  for  the  functions  defined  in 
(2.4b)  under  Assumption  2. 1  of  Se  ction  II. 

Corollary  3.5 

Assuming  all  the  strategies  are  PBPOJor  the  function  defined  in  (2.4b)  we  have 

Z2(U],\rH0)>'L2(ul,OrHo)  (3.18a) 

’L1(uxS)J1x)'Z'Ll(ux,\JHx)  (3.18b) 


Proof 

We  prove  the  first  inequality.  Using  Theorem  3.3  we  have 

I2(«i,l,//o)  =  Ev,i|W  J7(ul,v1,l,//0)  (3.19a) 

^  Ev>w  i . v  i,0 J3 o)  =  I^lO^o)  (3. 19b) 

This  completes  the  proof.  D 

In  other  words,  the  assumptions  in  (2.6)  and  (2.12)  of  Section  II  are  equivalent  to  Theorem 
3.3.  Due  to  these  assumptions,  the  thresholds  tUi,  rVi,  tufu  j),  and  fV](v !)  are  all  positive  quantities. 
It  must  be  pointed  out,  however,  that  if  we  take  J^(ux,v x,Hj)=cij  (i*j)  for  'V(ux,vx):p(ux,vx)=il 
i.e.,  for  an  impossible  occurrence  of  («i,v(),  then  we  get  rU)<0  when  p(ux= 0)=1,  and  since  the 
LR’s  are  nonnegative  quantities,  this  implies  declaring  0  for  all  values  of  Similarly,  if  we 
take  J7(u2,v2,//;)=e(rt-c1y  for  V(u2,v2):p(u2,v2)= 0,  then  we  get  rUj(M])<0  when  p(M2=0,W!)=!. 


(See  Example  4. 1  for  a  situation  where  this  behavior  is  observed.) 

We  maintained  earlier  that  the  results  of  these  theorems  would  save  a  substantial  amount  of 
computations  for  determining  the  optimal  solutions.  To  make  this  more  clear,  note  that  one  way 
of  finding  the  optimal  strategies  is  to  first  fix  the  supervisor’s  strategy  y  and  then  solve  the  system 
of  nonlinear  equations  in  (2. 15) — (2. 1 6)  or,  equivalently,  the  system  of  vector  equations  in  (2.21 ). 
Consequently,  by  exhausting  all  enumerations  of  the  supervisor’s  possible  strategies  one  can 
determine  the  optimal  solution.  The  significance  of  Theorems  3. 1-3.3  lies  in  the  fact  that  they 
significantly  reduce  the  number  of  possible  enumerations  of  optimal  y. 

In  general,  there  are  216  possibilities  since  there  are  four  decisions,  resulting  in  24  =  16  com¬ 
binations  of  local  decisions,  and  for  each  combination  the  supervisor  can  declare  either  a  0  or  a  1 . 
However,  using  a  straightforward  combinatorial  calculation,  it  can  be  shown  that  under 
Theorems  3. 1-3.2  the  number  of  possibilities  reduces  from  216  to  only  1150  possibilities,  i.e.,  a 
reduction  of  almost  two  orders  of  magnitude.  As  a  result,  the  optimal  solution  can  be  obtained 
much  faster. 


IV.  Computational  Considerations  and  Numerical  Results 


In  this  section,  we  consider  a  simple  example  of  two-step,  two-detector  binary  hypothesis 
testing,  and  discuss  two  ways  one  can  go  about  solving  this  problem.  The  example  presented 
here  deals  with  discrete  and  finite  pdfs.  Consequently,  it  has  at  least  one  globally  optimal  solu¬ 
tion.  All  the  results  of  the  last  sections  are  applicable  here  with  integral  signs  replaced  by  sum¬ 
mation  signs.  The  purpose  behind  this  example  is  to  illustrate  the  fact  that  the  local  decision  for, 
say,  detector  U  at  time  2  depends  not  only  on  the  observations  X,  and  X2  at  both  times,  but  also 
on  the  local  decision  u  |  sent  at  time  1. 

There  are  two  facets  to  the  decentralized  sequential  detection  problem  that  we  have  treated: 
time  and  space.  The  space  (hierarchy)  can  be  broken  in  two  ways  into  two  interrelated  minimiza¬ 
tions:  once  the  supervisor’s  decision  strategy  has  been  chosen,  the  problem  reduces  to  a  dynamic 
team  decision  problem.  The  decomposition  is  shown  below. 

min  min  E  [JCD1  -  (4.1) 

all  possible  all  possible 

decision  rules  local  strategies 
for  the  supervisor 

Conversely,  once  the  local  decision  rules  have  been  determined,  so  have  the  statistics  of  the 
“observations”  available  to  the  supervisor,  and  the  problem  from  his  viewpoint  becomes  a  cen¬ 
tralized  sequential  decision  problem.  The  decomposition  is  shown  below. 

min  min  E  (1(01  (4.2) 

all  possible  all  possible 

local  strategies  decision  strategies 
for  the  supervisor 

Each  of  these  formulations  adds  some  insight  into  the  problem.  In  the  former  decomposi¬ 
tion,  the  outer  minimization  is  finite  since  there  are  a  finite  number  of  decisions  u2  and  v2.  In 
general,  there  would  be  2l“,"“jllv,llvi1  possible  decision  rules  forS,  where  the  notation  \u  I  denotes 
the  cardinality  of  u  (which  was  taken  to  be  2  in  Section  II).  This  number  grows  fast  as  the 
number  of  detectors  or  the  size  of  their  decision  sets  increases.  However,  in  the  light  ol 
Theorems  3.1  and  3.2,  these  possibilities  are  reduced  by  a  great  deal. 
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Now,  once  the  S’s  strategy  has  been  selected,  it  is  a  matter  of  solving  the  system  of  non¬ 
linear  equations  in  (2. 1 5) — (2. 1 6)  or,  equivalently,  the  system  of  nonlinear  vector  equations  in 
(2.21).  After  local  strategies  have  been  determined,  the  overall  cost  can  be  computed.  This  pro¬ 
cedure  can  be  repeated  to  exhaust  all  possible  decision  rules  for  S  (taking  into  account 
Theorems  3.1— 3.2)  and  choose  the  rule(s)  which  results  in  the  smallest  expected  cost.  The  func¬ 
tions  fu  and  fv  are  very  complicated;  fortunately  (under  Assumption  2. 1  or,  more  generally, 
Assumption  2.2)  we  can  perform  this  computation  off-line. 

Let  us  now  focus  our  attention  on  discrete  and  finite  pdf’s  (as  in  Examples  4.1  below).  In 
this  case,  as  was  mentioned  earlier,  there  does  exist  at  least  one  globally  optimal  solution,  which 
is  given  by  LRT’s  and  can  be  determined  by  finding  all  PBPO  solutions  and  picking  the  one(s) 
that  results  in  the  smallest  cost  For  such  a  problem,  the  decomposition  in  (4.2)  is  superior.  If  the 
form  of  the  solution  were  not  known  for  the  two-stage  problem  of  Section  II,  all  possible  parti¬ 
tions  of  the  X and  Y  UY2  planes  (i.e.,  all  possible  values  for p(u  {\H),  p(v  ,1//),  p(u2\u  hH),  and 
p(v2lv  \JT) )  would  have  to  be  considered.  But  we  already  know  that  the  optimal  solution  has  the 
form  of  LRT’s.  A  straightforward  calculation  will  show  that  the  knowledge  of  this  fact  will 
result  in  a  reduction  in  the  number  of  possible  decision  regions  from  order  of  2n'ni  to  order  of 
n  fn 2,  where  n,  is  the  number  of  points  in  p(X,  \H).  Consequently,  the  problem  treated  in 
Section  II  does  not  fall  under  any  of  the  NP-complete  categories  discussed  in  [12]  and  can  be 
solved  in  polynomial  time. 

Hence,  for  a  problem  with  discrete  and  finite  pdf’s,  the  decomposition  in  (4.2)  suggests  the 
following  algorithm  for  finding  the  global  optimal  solution  via  exhaustive  enumeration: 

Algorithm  4.1 

A.  Find  LR’s  at  time  1  and,  for  each  detector,  sort  them  in  descending  order. 

B.  At  time  1  pick  the  region  corresponding  to  the  largest  LR,  and  send  0  over  that  region. 
(This  fixes  the  thresholds  at  time  1.) 

a.  At  time  2  compute  and  sort  LR’s  for  each  detector. 

b.  Pick  the  decision  region  corresponding  to  the  largest  LR,  and  send  0  over  this  region. 
(This  fixes  the  thresholds  at  time  2.) 

c.  Now  all  local  strategies  (i.e.,  all  local  thresholds)  are  specified.  Find  the  supervisor’s 
strategy  (by,  say,  using  Fig.  2.1)  and  compute  the  expected  cost. 

d.  Add  the  region  corresponding  to  the  next  LR  at  time  2  to  the  previous  decision 
region.  Repeat  part  c  above. 

C.  Add  the  region  corresponding  to  the  next  LR  at  time  1  and  repeat  a~d  above. 

The  regions  that  lead  to  the  smallest  cost  compose  the  optimal  decision  rule  (and  there  may 
be  several  of  them).  In  the  example  below,  we  use  Algorithm  4.1  to  determine  the  global  optimal 
strategy.  (Since  we  deal  with  discrete  p.d.f.’s,  the  possible  values  of  the  likelihood  ratios  will 
provide  lower  and  upper  limits  on  the  thresholds.)  Then,  Eqs.  (2.7b),  (2.8b),  (2.13b),  and  (2.14b) 
are  used  to  recompute  the  thresholds,  which  must  agree  with  the  optimal  decision  rules  found  ear¬ 
lier.  Assume  samples  are  equally  spaced  starting  from  point  1,  and  define  the  following  notation 
for  convenience: 

p(X\H)  =  {q , . qj  <^>  p(X=  UH)  -  ql . p(X=n\H)  =  qn 

n 

where,  obviously,  £  q;  =1. 

-H 
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Example  4.1 

Let  U’s  and  V’s  density  functions  be  as  follows: 

p(X  ,l//0)  =  {0.30,  0.40, 0.30/  p(X X\H ])  =  {0.25,  0.55,  0.20/ 

p(X2\H0 )  =  {0.40,  0.10,  0.50/  p{X2\Hx)  =  {0.35,  0.20,  0.45/ 

p(Y  X\H0)  =  fO.10,  0.30,  0.40,  0.20/  p(Yx\Hx)  =  {0.25,  0.05,  0.55,  0.15/ 

P(Y2\Hq)  =  {0.30, 0.60,  0. 10/  p(Y2\H ,)  =  f0.20,  0.45,  0.35/ 

Assume  that  Xx,  X2,  Y  x,  and  Y2  are  mutually  independent  given  H0  or  H  x.  Take 
Cojpcn^,  c0i=c  io=100,  C(j=4,  and  pcf=p  i=0.5.  Detector  U’s  LR’s  at  time  1  are  given  by 

^(XjWl.2000,  0.7273,  1.5000/ 
while  V’s  LR’s  are 

A^<7 ,)  =  (0.4000,  6.0000,  0.7273,  1.3333/  • 

Also,  U’s  LR’s  at  time  2,  xJCi) are 

X2 


*1 

while  V’s  X^fT t, Y £  are 


y  2 


After  using  the  above  algorithm,  it  turns  out  that  there  is  a  single  optimal  rule,  in  which  U 
and  V  follow  different  strategies  (obviously),  and  also  each  detector’s  strategy  at  time  2  depends 
on  his  decision  at  time  1.  The  following  conditional  probabilities  of  local  decisions  summarize 
the  local  optimal  rules: 

p(u  ,=OI//0)  =  0.6000  p(u  i=0l/7  ()  =  0.4500 

P(u2^0\ux=0Mq)  =  0.9000  p(uz=0\ux=OJ1  x)  =  0.8000 

p(u2=O\ul=\tf0)=  1.0000  p{u2=0\ux=\Jix)=  1.0000 

p(v  ,=OI//o)  =  0.5000  p(y  i=OI//  j)  =  0.2000 

p(V2= 01  v  x=OJi 0)  =  0.9600  p(v2=0lv  t=0  i)  =  0.7375 

p(v  2=0lv  ,=1  J10)  =  0.7200  p(v  2=0lv  ,=1  ,)  =  0.4469 

The  supervisor's  strategy  is  given  by  Table  4.1,  where  x  signifies  an  impossible  event. 

For  this  example  the  expected  cost  is 
E{Jy(u2,v2J/)l  =  32.61 125 
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u2v2 


00 

01 

10 

ll 

00 

0 

0 

0 

0 

U,V]  01 

0 

1 

1 

1 

10 

0 

1 

X 

X 

11 

1 

1 

1 

1 

Table  4.1 

According  to  this  rule,  U  sends  a  0  at  time  1  if  he  observes  a  1  or  a  3,  while  V  sends  a  0  if 
he  observes  a  2  or  a  4.  At  time  2,  U  sends  a  0  if  he  gets  X 2=  1  or  3  and  his  previous  decision  has 
been  «!=(),  and  he  sends  a  0  anyway  after  having  sent  u  pi.  V’s  decision  at  time  2  is  more 
interesting  because  it  depends  on  Yx  as  well  as  v,:  after  having  sent  vpO,  V  sends  v2=0  for 
( Y  (2,1),  (2,2),  (2,3),  (4,1),  or  (4,2);  on  the  other  hand,  if  he  sent  v  j=l  at  time  1,  he  would 

send  V2=0  for  ( Y  i.Y^  (3,1)  and  (3,2)  only. 

Using  the  above  local  conditional  probabilities  and  Table  4.1,  we  can  compute  the  thres¬ 
holds  from  Eqs.  (2.7b),  (2.8b),  (2.13b),  and  (2.14b).  We  obtain 

tu,=  1.0538  fv,=  1.1584 

rUj(0)  =  0.9931  rVj(0)=  1.3750 

ru,(l)  =  -0.1094  fv,(l)  =  0.6667 

As  can  be  seen,  these  computed  thresholds  agree  with  the  optimal  strategies  given  above  (which 
were  computed  by  exhausting  all  possible  enumerations  of  threshold  strategies).  Interestingly 
enough,  the  threshold  rUi(l)  is  negative,  as  it  should  be  since  at  time  2,  u^= 0  for  all  values  of  X2  if 
u  !=1  (c.f.  Table  4.1  and  the  discussion  following  Eq.  (3.19b)). 


V.  Discussion  and  Concluding  Remarks 


We  have  examined  in  detail  a  two-step,  two-detector  decentralized  hypothesis-testing  prob¬ 
lem  and  have  discussed  its  straightforward  extension  to  a  decentralized  multi-stage  sequential 
detection,  thereby  capturing  three  important  features  of  decentralized  detection  and  decision  mak¬ 
ing;  (a)  hierarchical  structure;  (b)  multistage  or  sequential  structure;  (c)  information  rate  or 
bandwidth  reduction.  We  have  successfully  employed  the  Bayes  criterion  to  find  the  PBPO  solu¬ 
tions.  According  to  each  PBPO  solution,  under  the  assumption  that  local  processors  make  obser¬ 
vations  independent  of  one  another  given  either  hypothesis,  each  local  agent’s  strategy  at  every 
time  instant  is  given  by  a  likelihood-ratio  test  (LRT).  It  was  also  illustrated  that  each  detector's 
threshold  at  each  time  instant  depends  on  his  past  decisions  (as  well  as  his  past  and  present  obser¬ 
vations).  Thus,  at  the  time  each  local  agent  computes  a  LR  (Likelihood  Ratio)  and  tests  it  against 
these  thresholds,  which  can  be  computed  off-line.  Each  detector’s  thresholds  at  a  given  time 
instant,  however,  are  coupled  with  the  other  detector’s  thresholds  at  all  times,  as  well  as  his  own 
future  thresholds. 

This  form  of  solution  is  not  at  all  obvious  in  advance.  Indeed,  one  might  think  that  the  local 
observers  at,  say,  time  1  may  well  wish  purposely  to  choose  their  decisions  in  accordance  with  a 
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non-LRT  policy  that  reveals  to  the  supervisor  more  information  (about,  say,  the  local  observers’ 
conditional  probability  of  one  of  the  hypotheses)  at  time  2.  As  we  have  shown,  this  cannot  be  the 
case.  It  is  therefore  not  unjustified  to  claim  that  the  structure  of  the  solution  that  has  been 
discovered  is  more  significant  than  the  actual  way  in  which  one  should  go  about  determining  the 
optimal  thresholds  in  a  specific  example. 

It  has  been  pointed  out  recently  in  [12]  that  a  large  class  of  distributed  detection  problems  is 
NP-compIete,  and  that  for  the  problems  falling  into  this  class  computational  considerations  leave 
no  alternative  but  to  seek  computable  suboptimum  solutions.  However,  not  all  decentralized 
detection  problems  have  been  shown  to  be  NP-complete.  Indeed,  due  to  our  independence 
assumption  (Assumption  2. 1  or,  more  generally,  Assumption  2.2  of  Section  II)  the  problems 
treated  in  this  paper  are  not  NP-complete  and  can  be  computed  in  polynomial  time.  In  fact,  we 
have  presented  explicit  computation  of  the  decision  rules  and  have  discussed  in  Section  III  how 
the  number  of  computations  is  reduced  from  an  exponential  order  to  a  polynomial  order. 

Other  issues  that  are  worthy  of  consideration  include  examining  the  problem  for  different 
information  patterns  (e.g.,  allowing  communication  among  local  agents  or  an  extension  of  the 
tandem  topology  discussed  in  [2]  to  multistage  problems)  or  for  the  case  of  conditionally  depen¬ 
dent  observations. 
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Fig.  12:  The  decision  regions  ax  time  1  and  2  for  a  typical  example.  The  strategy  at  time  2 
corresponding  to  each  region 

(GW  :#!<*!) -Ui) 

is  described  by  a  likelihood-rahon  test.  However,  the  strategy  corresponding  to  the  entire 
plane  is  in  general  not  an  LRT. 


